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CRYSTALLIZED PPARa LIGAND BINDING DOMAIN POLYPEPTIDE AND 

SCREENING METHODS EMPLOYING SAME 

Technical Field 

The present invention relates generally to the structure of the ligand 
binding domain of PPARa, and more particularly to the structure of the ligand 
binding domain of PPARa in complex with a ligand. The invention further 
relates to methods by which modulators and ligands of PPARa and other 
PPARs can be identified. 



Abbreviations 

ATP adenosine triphosphate 

ADP adenosine diphosphate 

BSA bovine serum albumin 

cDNA complementary DNA 

DBD DNA binding domain 

DMSO dimethyl sulfoxide 

DNA deoxyribonucleic acid 

DTT dithiothreitol 

EDTA ethylenediaminetetraacetic acid 

HEPES N-2-Hydroxyethylpiperazine-N-2-ethanesulfonic 

acid 

kDa kilodalton(s) 

LBD ligand binding domain 

mPPAR mouse peroxisome proliferator activated receptor 

NDP nucleotide diphosphate 

NTP nucleotide triphosphate 

PAGE polyacrylamide gel electrophoresis 

PCR polymerase chain reaction 

pi isoelectric point 

PPAR peroxisome proliferator-activated receptor 

PPARa peroxide proliferator-activated receptor alpha 
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PPRE PPAR response element 

rPPAR rat peroxisome proliferator activated receptor 

RXR retinoid X receptor 

SDS sodium dodecyl sulfate 

SDS-PAGE sodium dodecyl sulfate polyacrylamide gel 

electrophoresis 
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Background Art 

Nuclear receptors reside in either the cytoplasm or nucleus of 
30 eukaryotic cells and represent a superfamily of proteins that specifically bind a 
physiologically relevant small molecule, such as a hormone or vitamin. As a 
result of a molecule binding to a nuclear receptor, the nuclear receptor 
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changes the ability of a cell to transcribe DNA, i.e. nuclear receptors modulate 
the transcription of DNA. However, they can also have transcription 
independent actions. 

Unlike integral membrane receptors and membrane-associated 
receptors, nuclear receptors reside in either the cytoplasm or nucleus of 
eukaryotic cells. Thus, nuclear receptors comprise a class of intracellular, 
soluble, ligand-regulated transcription factors. Nuclear receptors include but 
are not limited to receptors for glucocorticoids, androgens, mineralcorticoids, 
progestins, estrogens, thyroid hormones, vitamin D, retinoids, icosanoids and 
pertinently, peroxisome proliferators. Many nuclear receptors, identified by 
either sequence homology to known receptors (See, e.g. , Drewes et al. , 
(1996) Mol. Cell. Biol. 16:925-31) or based on their affinity for specific DNA 
binding sites in gene promoters (See, e.g. , Sladek et al. , Genes Dev. 4:2353- 
65), have unascertained ligands and are therefore termed "orphan receptors". 

Peroxisomes are organelles that are involved in the p-oxidation of long- 
chain fatty acids and the catabolism of cholesterol to bile acids (See, e.g. , 
Vamecq & Draye , (1989) Essays Biochem. 24: 115-225). Peroxisome 
proliferators are a structurally diverse group of compounds which, when 
administered to rodents, elicit dramatic increases in the size and number of 
hepatic and renal peroxisomes, as well as concomitant increases in the 
capacity of peroxisomes to metabolize fatty acids via increased expression of 
the enzymes required for the p-oxidation cycle (Lazarow & Fujiki , (1985) Ann. 
Rev. Cell Biol. 1: 489-530; Vamecq & Draye , (1989) Essays Biochem. 24: 
115-225; and Nelali et al. , (1988) Cancer Res. 48: 5316-5324). Chemicals of 
this group include the fibrate class of hypolipidemic drugs, herbicides, 
phthalate plasticizers, unsaturated fatty acids, and leukotriene antagonists 
(reviewed in Green , (1992) Biochem. Pharmacol. 43: 393-401). Peroxisome 
proliferation can also be elicited by dietary or physiological factors such as a 
high-fat diet and cold acclimatization. 

Peroxisome proliferator activated receptors (PPARs) are activated by 
one of a group of compounds known as peroxisome proliferators. Insight into 
the mechanism by which peroxisome proliferators exert their pleiotropic 
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effects was provided by the identification of a member of the nuclear hormone 
receptor superfamily activated by the chemicals described above ( Isseman & 
Green , (1990) Nature 347: 645-50). This receptor, termed peroxisome 
proliferator activated receptor alpha (PPARa), was subsequently shown to be 
5 activated by a variety of medium and long-chain fatty acids. PPARa was also 
shown to modulate expression of a variety of genes containing one or more 
PPAR responsive elements found in their promoter regions. 

It appears that PPARa has a role in the regulation of virtually the entire 
oxidative pathway of fatty acids and their derivatives ( See , Lemberger et al. , 

10 (1996) Ann. Rev. Cell. Dev. Biol. 12: 335-63). It has also been observed that 
PPARa expression is closely tied to conditions that induce elevated 
glucocorticoid levels such as fasting, diurnal rhythm (Lemberger et al. , (1996) 
J. Biol. Chem. 271 : 1 764-69) and stress. 

Structurally, PPARa comprises three functional domains, the N 

15 terminus region, the DNA binding domain and the ligand binding domain. 
These domains retain their functional autonomy when they are expressed as 
a chimeric or fusion protein ( Gottlicher et al. , (1992) Proc. Natl. Acad. Sci. 
U.S.A. 89: 4653-57). 

PPARa activates transcription by binding to DNA sequence elements, 

t 

20 termed PPAR response elements (PPRE), as heterodimers (dimerization is 
essential for the activity of PPARs) with the retinoid X receptors (RXR) ( See , 
Keller & Whali , (1993) Trends Endocrin. Met. 4: 291-96), which are 
themselves activated by 9-cis retinoic acid ( See , Kliewer et al. , (1992) Nature 
. 358: 771-74; Gearing et al. , (1993) Proc. Natl. Acad. Sci. U.S.A. 90: 1440-44; 

25 Keller et aL , (1993) Proc. Natl. Acad. Sci. U.S.A. 90: 2160-2164; Heyman et 
aL, (1992) Cell 68: 397-406 and Levin et al. , (1992) Nature 355: 359-61). 
Since the PPARct-RXR complex can be activated by peroxisome proliferators 
and/or 9-cis retinoic acid, the retinoid and fatty acid signaling pathways are 
seen to converge in modulating lipid metabolism. 

30 PPREs have been identified in the enhancers of a number of genes 

that encode proteins that regulate lipid metabolism, including: (1) the three 
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enzymes required for peroxisomal p-oxidation of fatty acids; (2) medium-chain 
acyl-CoA dehydrogenase, a key enzyme in mitochondrial p-oxidation; and (3) 
aP2, a lipid binding protein expressed exclusively in adipocytes. Thus, the 
nature of the PPAR target genes coupled with the activation of PPARs by fatty 
5 acids and hypolipidemic drugs suggests a physiological role for the PPARs in 

* 

a variety of physiological phenomena, including lipid homeostasis ( See, e.g. , 
Keller & Whali , (1993) Trends Endocrin. Met 4: 291-96). PPARs have also 
been implicated in glucose homeostasis disorders and in atherosclerosis. 
These conditions may exist alone or together in a complex phenotype of 

1 0 metabolic disorders known as syndrome X. 

Since the discovery of PPARa, additional subtypes of PPAR have been 
identified, e.g. PPARy and PPARS, which are spatially differentially expressed. 
Because there are several subtypes of PPAR, it is desirable to identify 
compounds that are capable of selectively interacting with only one of the 

15 PPAR subtypes, notably PPARa. Compounds capable of interacting with 
PPARa exclusively would find a wide variety of applications, for example, in 
the prevention of obesity, for the treatment of diabetes, and other deleterious 
conditions, as noted above. Development of such compounds, however, has 
been hindered by a lack of detailed structural information on the ligand 

20 binding domain of PPARa and particularly by a lack of structural information 
on the conformation of the ligand binding domain of PPARa as it binds a 
modulating compound. 

It is believed that PPARa regulates some of the same genes as PPARy 
and PPAR8. However, some genes might be upregulated by one PPAR and 

25 downregulated by another PPAR. Up- or down-regulation of certain genes by 
a PPARy agonist might cause detrimental side effects. It might be possible to 
use a PPARa or PPAR5 agonist, partial agonist or antagonist to down- or up- 
regulate (respectively) these same genes, and thereby reduce the detrimental 
side-effects. More generally, it might be possible to individually up- and 

30 down-regulate specific genes to achieve a specific therapeutic goal by 
administering a PPAR activator (or partial activator) that activates (or 



WO 02/064632 



PCT/US02/03278 



-7- 

deactivates) each PPAR to the appropriate extent. Design .or discovery of 
such a compound would be greatly facilitated by three-dimensional structures 
for each of the three target receptors, PPARa, PPARy and PPAR8. 

Polypeptides, including the ligand binding domain of PPARa, have a 
5 three-dimensional structure determined by the primary amino acid sequence 
and the environment surrounding the polypeptide. This three-dimensional 
structure establishes the polypeptide's activity, stability, binding affinity, 
binding specificity, and other biochemical attributes. Thus, knowledge of a 
protein's three-dimensional structure can provide much guidance in designing 

10 agents that mimic, inhibit, or improve its biological activity. 

The three-dimensional structure of a polypeptide can be determined in 
a number of ways. Many of the most precise methods employ X-ray 
crystallography (See, e.g. , Van Holde , (1971) Physical Biochemistry , Prentice- 
Hall, N. J., 221-39). This technique relies on the ability of crystalline lattices to 

15 diffract X-rays or other forms of radiation. Diffraction experiments suitable for 
determining the three-dimensional structure of macromolecules typically 
require high-quality crystals. Unfortunately, such crystals have been 
unavailable for the ligand binding domain of PPARa, as well as many other 
proteins of interest. Thus, high-quality diffracting crystals of the ligand binding 

20 domain of PPARa in complex with a ligand would greatly assist in the 
elucidation of its three-dimensional structure. 

■ 

Clearly, the solved crystal structure of the PPARa ligand binding 
domain polypeptide would be useful in the design of modulators of activity 
mediated by all of the PPARs. Evaluation of the available sequence data has 

25 made it clear that PPARa shares significant sequence homology with the 
other PPARs. Further, PPARa shows structural homology with the three- 
dimensional fold of the other PPARs. Thus, in theory, it might be considered 
feasible to design modulators of PPARa based exclusively on the sequence 
and three-dimensional fold of a different PPAR, for example, PPARy. This 

30 method, however, would likely be unproductive and certainly hindered by a 
lack of subtle structural details of the various binding sites and pertinent 
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residues of PPARa involved in the binding event. A solved crystal structure 
would provide these structural details. 

The solved PPARa-ligand crystal structure would provide structural 
details and insights necessary to design a modulator of PPARa that 
maximizes preferred requirements for any modulator, i.e. potency and 
specificity. By exploiting the structural details obtained from a PPAR-ligand 
crystal structure, it would be possible to design a PPAR modulator that, 
despite PPARa's similarity with other PPARs, exploits the unique structural 
features of PPARa. A PPAR modulator developed using structure-assisted 
design would take advantage of heretofore unknown PPAR structural 
considerations and thus be more effective than a modulator developed using 
homology-based design. Potential or existent homology models cannot 
provide the necessary degree of specificity. A PPAR modulator designed 
using the structural coordinates of a crystalline form of PPARa would also 
provide a starting point for the development of modulators of other PPARs. 

What is needed, therefore, is a crystallized form of a PPARa ligand 
binding domain, preferably in complex with a ligand. Acquisition of crystals of 
the PPARa ligand binding domain (LBD) polypeptide will permit the three 
dimensional structure of PPARa LBD polypeptide to be determined. 
Knowledge of the three dimensional structure will facilitate the design of 
modulators of PPARa activity. Such modulators can lead to therapeutic 
compounds to treat a wide range of conditions, including lipid homeostasis 
disorders, glucose homeostasis disorders, inflammation, atherosclerosis and 
syndrome X. 

Summary of the Invention 
A substantially pure PPARa ligand binding domain polypeptide in 
crystalline form is disclosed. Preferably, the crystalline form has lattice 
constants of a = 61 .3 A, b = 103.5 A, c = 49.9 A, a = 90°, (5 = 90°, y = 90° or 
lattice constants of a = 95.58 A, b = 122.06 A, c = 122.10 A, a = 90°, p = 90°, 
y = 90°. Preferably, the crystalline form is an orthorhombic crystalline form. 
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More preferably, the crystalline form has a space group of P2 i 2 1 2 or space 
group P2i2i2i. Even more preferably, the PPARct ligand binding domain 
polypeptide has the amino acid sequence shown in SEQ ID NO: 4. Even 
more preferably, the PPARa ligand binding domain has a crystalline structure 
5 further characterized by the coordinates corresponding to Table 2. 

Preferably, the PPARa ligand binding domain polypeptide is in complex 
with a ligand. Optionally, the crystalline form contains one or four PPARa 
ligand binding domain polypeptides in the asymmetric unit. Preferably, the 
crystalline form is such that the three-dimensional structure of the crystallized 

10 PPARa ligand binding domain polypeptide can be determined to a resolution 
of about 1 .8 A or better. Even more preferably, the crystalline form contains 
one or more atoms having a molecular weight of 40 grams/mol or greater. 

A method for determining the three-dimensional structure of a 
crystallized PPARa ligand binding domain polypeptide to a resolution of about 

15 1 .8 A or better is disclosed. The method comprises (a) crystallizing a PPARa 
ligand binding domain polypeptide; and (b) analyzing the PPARa ligand 
binding domain polypeptide to determine the three-dimensional structure of 
the crystallized PPARa ligand binding domain polypeptide, whereby the three- 
dimensional structure of a crystallized PPARa ligand binding domain 

20 polypeptide is determined to a resolution of about 1 .8 A or better. Preferably, 
the analyzing is by X-ray diffraction. More preferably, the crystallization is 
accomplished by the hanging drop vapor diffusion method, and wherein the 
PPARa ligand binding domain is mixed with an equal volume of reservoir. 
Even more preferably, the reservoir comprises 4-8% PEG 3350, 100-200mM 

25 NaF, and 12-16% 2,5 hexanediol or the reservoir comprises 50 mM bis-tris- 
propane, 4-6% PEG 3350, 150 mM NaN0 3 , 16% 2,5 hexanediol, and 1-3 mM 
YCI 3 . 

A method of designing a modulator of a PPAR polypeptide is disclosed. 
The method comprises (a) designing a potential modulator of a PPAR 
30 polypeptide that will form bonds with amino acids in a substrate binding site 
based upon a crystalline structure of a PPARa ligand binding domain 
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polypeptide; (b) synthesizing the modulator; and (c) determining whether the 
potential modulator modulates the activity of the PPAR polypeptide, whereby 
a modulator of a PPAR polypeptide is designed. 

A method of designing a modulator that selectively modulates the 
5 activity of a PPAR polypeptide is disclosed. The method comprises (a) 
obtaining a crystalline form of a PPARa ligand binding domain polypeptide; 
(b) evaluating the three-dimensional structure of the crystallized PPARa 
ligand binding domain polypeptide; and (c) synthesizing a potential modulator 

* 

based on the three-dimensional crystal structure of the crystallized PPARa 

10 ligand binding domain polypeptide, whereby a modulator that selectively 
modulates the activity of a PPARa polypeptide is designed. Preferably, the 
method further comprises contacting a PPARa ligand binding domain 
polypeptide with the potential modulator; and assaying the PPARa ligand 
binding domain polypeptide for binding of the potential modulator, for a 

1 5 change in activity of the PPARa ligand binding domain polypeptide, or both. 
More preferably, the crystalline form is in orthorhombic form. Even more 
preferably, the crystals are such that the three-dimensional structure of the 
crystallized PPARa ligand binding domain polypeptide can be determined to a 
resolution of about 1 .8 A or better. 

20 A method of screening a plurality of compounds for a modulator of a 

PPAR ligand binding domain polypeptide is disclosed. The method comprises 
(a) providing a library of test samples; (b) contacting a crystalline PPARa 
ligand binding domain polypeptide with each test sample; (c) detecting an 
interaction between a test sample and the crystalline PPARa ligand binding 

25 domain polypeptide; (d) identifying a test sample that interacts with the 
crystalline PPARa ligand binding domain polypeptide; and (e) isolating a test 
sample that interacts with the crystalline PPARa ligand binding domain 
polypeptide, whereby a plurality of compounds is screened for a modulator of 
a PPAR ligand binding domain polypeptide. Preferably, the test samples are 

30 bound to a substrate, and more preferably, the test samples are synthesized 
directly on a substrate. 
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A method for identifying a PPAR modulator is disclosed. The method 
comprises (a) providing atomic coordinates of a PPARa ligand binding 
domain to a computerized modeling system; and (b) modeling ligands that fit 
spatially into the binding pocket of the PPARa ligand binding domain to 
5 thereby identify a PPAR modulator. Preferably, the method further comprises 
identifying in an assay for PPAR-mediated activity a modeled ligand that 
increases or decreases the activity of the PPAR. 

A method of identifying a PPARa modulator that selectively modulates 
the activity of a PPARa polypeptide compared to other polypeptides is 

10 disclosed. The method comprises (a) providing atomic coordinates of a 
PPARa ligand binding domain to a computerized modeling system; and (b) 
modeling a ligand that fits into the binding pocket of a PPARa ligand binding 
domain and that interacts with conformationally constrained residues of a 
PPARa conserved among PPAR subtypes to thereby identify a PPARa 

15 modulator. Preferably, the method further comprises identifying in a biological 
assay for PPARa activity a modeled ligand that selectively binds to PPARa 
and increases or decreases the activity of said PPARa. 

A method of designing a modulator of a PPAR polypeptide is disclosed. 
The method comprises (a) selecting a candidate PPAR ligand; (b) determining 

20 which amino acid or amino acids of a PPAR polypeptide interact with the 
ligand using a three-dimensional model of a crystallized protein comprising a 
PPARa LBD; (c) identifying in a biological assay for PPAR activity a degree to 
which the ligand modulates the activity of the PPAR polypeptide; (d) selecting 
a chemical modification of the ligand wherein the interaction between the 

25 amino acids of the PPAR polypeptide and the ligand is predicted to be 
modulated by the chemical modification; (e) performing the chemical 
modification on the ligand to form a modified ligand; (f) contacting the 
modified ligand with the PPAR polypeptide; (g) identifying in a biological 
assay for PPAR activity a degree to which the modified ligand modulates the 

30 biological activity of the PPAR polypeptide; and (h) comparing the biological 
activity of the PPAR polypeptide in the presence of modified ligand with the 



WO 02/064632 



PCT/US02/03278 



-12- 

biological activity of the PPAR polypeptide in the presence of the unmodified 
ligand, whereby a modulator of a PPAR polypeptide is designed. Preferably, 
the PPAR polypeptide is a PPARa polypeptide. More preferably, the three- 
dimensional model of a crystallized protein is a PPARa LBD polypeptide with 
5 a bound ligand. Optionally, the method further comprises repeating steps (a) 
through (f), if the biological activity of the PPAR polypeptide in the presence of 
the modified ligand varies from the biological activity of the PPAR polypeptide 
in the presence of the unmodified ligand. 

An assay method for identifying a compound that inhibits binding of a 

10 ligand to a PPAR polypeptide is disclosed. The assay method comprises (a) 
incubating a PPAR polypeptide with a ligand in the presence of a test inhibitor 
compound; (b) determining an amount of ligand that is bound to the PPAR 
polypeptide, wherein decreased binding of ligand to the PPAR protein in the 
presence of the test inhibitor compound relative to binding of ligand in the 

15 absence of the test inhibitor compound is indicative of inhibition; and (c) 
identifying the test compound as an inhibitor of ligand binding if decreased 
ligand binding is observed. 

A method of identifying a PPAR modulator that selectively modulates 
the biological activity of one PPAR subtype compared to PPARa is disclosed. 

20 The method comprises: (a) providing an atomic structure coordinate set 
describing a PPARa ligand binding domain structure and at least one other 
atomic structure coordinate set describing a PPAR ligand binding domain, 
each ligand binding domain comprising a ligand binding site; (b) comparing 
the PPAR atomic structure coordinate sets to identify at least one diference 

25 between the sets; (c) designing a candidate ligand predicted to interact with 
the difference of step (b); (d) synthesizing the candidate ligand; and (e) testing 
the synthesized candidate ligand for an ability to selectively modulate a PPAR 
subtype as compared to PPARa, whereby a PPAR modulator that selectively 
modulates the biological activity of one PPAR subtype compared to PPARa is 

30 identified. 

Accordingly, it is an object of the present invention to provide a three 
dimensional structure of the ligand binding domain of PPARa. The object is 
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achieved in whole or in part by the present invention. 

An object of the invention having been stated hereinabove, other 
objects will be evident as the description proceeds, when taken in connection 
with the accompanying Drawings and Laboratory Examples as best described 

« 

5 hereinbelow. 



Brief Description of the Drawings 
10 Figure 1 is a ribbon diagram depicting the PPARot LBD in complex with . 

Compound 1. The PPARa LBD is presented as a ribbon diagram and 
Compound 1 is presented as a spacefilling model. 

Figure 2 is a schematic drawing depicting interactions between PPARot 
and Compound 1 . Residues that lie within 5.5A of heavy atoms in the ligand 
15 are shown. 

Figure 3 depicts overlayed ball and stick diagrams of PPARa, PPARy 
and PPAR5. PPARa is shown in dark gray, PPARy in medium gray and 
PPAR5 in light gray. PPAR5 was used as a template in the molecular 
replacement solution of PPARa. 
20 Figure 4 is a ribbon diagram of the PPARa backbone conformation, 

where the ligand binding pocket of PPARa is identified by a smooth solid 
surface. 

Figure 5 is a ribbon diagram of the PPARy backbone conformation, 
where the ligand binding pocket of PPARy is identified by a smooth solid 
25 surface. 

Figure 6 is a ribbon diagram of the PPAR8 backbone conformation, 
where the ligand binding pocket of PPAR8 is identified by a smooth solid 
surface. 

Figure 7 is a ball-and-stick model depicting hydrogen bonding 
30 interactions between Compound 1 and PPARa. Water molecules are shown 
as octahedral (six-pointed) crosses. 



WO 02/064632 



PCT7US02/03278 



-14- 

Detailed Description of the Invention 
Until disclosure of the present invention presented herein, the ability to 
obtain crystalline forms of a PPARa LBD has not been realized. And until 
5 disclosure of the present invention presented herein, a detailed three- 
dimensional crystal structure of a PPARa polypeptide has not been solved. 

In addition to providing structural information, crystalline polypeptides 
provide other advantages. For example, the crystallization process itself 
further purifies the polypeptide, and satisfies one of the classical criteria for 

10 homogeneity. In fact, crystallization frequently provides unparalleled 
purification quality, removing impurities that are not removed by other 
purification methods such as HPLC, dialysis, conventional column 
chromatography, etc. Moreover, crystalline polypeptides are often stable at 
ambient temperatures and free of protease contamination and other 

15 degradation associated with solution storage. Crystalline polypeptides can 
also be useful as pharmaceutical preparations. Finally, crystallization 
techniques in general are largely free of problems such as denaturation 
associated with other stabilization methods (e.g., lyophilization). Once 
crystallization has been accomplished, crystallographic data provides useful 

20 structural information that can assist the design of compounds that can serve 
as agonists or antagonists, as described herein below. In addition, the crystal 
structure provides information useful to map a receptor binding domain, which 

4 

could then be mimicked by a small non-peptide molecule that would serve as 
an antagonist or agonist. 

25 

L Definitions 

Following long-standing patent law convention, the terms "a" and "an" 
mean "one or more" when used in this application, including the claims. 

As used herein, the term "mutation" carries its traditional connotation 
30 and means a change, inherited, naturally occurring or introduced, in a nucleic 
acid or polypeptide sequence, and is used in its sense as generally known to 
those of skill in the art. 
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As used herein, the term "labeled" means the attachment of a moiety, 
capable of detection by spectroscopic, radiologic or other methods, to a probe 
molecule. 

As used herein, the term "target cell" refers to a cell, into which it is 

5 desired to insert a nucleic acid sequence or polypeptide, or to otherwise effect 
a modification from conditions known to be standard in the unmodified cell. A 
nucleic acid sequence introduced into a target cell can be of variable length. 
Additionally, a nucleic acid sequence can enter a target cell as a component 
of a plasmid or other vector or as a naked sequence. 

10 As used herein, the term "transcription" means a cellular process 

involving the interaction of an RNA polymerase with a gene that directs the 
expression as RNA of the structural information present in the coding 
sequences of the gene. The process includes, but is not limited to the 
following steps: (a) the transcription initiation, (b) transcript elongation, (c) 

15 transcript splicing, (d) transcript capping, (e) transcript termination, (f) 
transcript polyadenylation, (g) nuclear export of the transcript, (h) transcript 
editing, and (i) stabilizing the transcript. 

As used herein, the term "expression" generally refers to the cellular 
processes by which a biologically active polypeptide is produced. 

20 As used herein, the term "transcription factor" means a cytoplasmic or 

nuclear protein which binds to such gene, or binds to an RNA transcript of 
such gene, or binds to another protein which binds to such gene or such RNA 
transcript or another protein which in turn binds to such gene or such RNA 
transcript, so as to thereby modulate expression of the gene. Such 

25 modulation can additionally be achieved by other mechanisms; the essence of 
"transcription factor for a gene" is that the level of transcription of the gene is 
altered in some way. 

As used herein, the term "hybridization" means the binding of a probe 
molecule, a molecule to which a detectable moiety has been bound, to a 

30 target sample. 

As used herein, the term "detecting" means confirming the presence of 
a target entity by observing the occurrence of a detectable signal, such as a 
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radiologic or spectroscopic signal that will appear exclusively in the presence 
of the target entity. 

As used herein, the term "sequencing" means the determining the 
ordered linear sequence of nucleic acids or amino acids of a DNA or protein 
5 target sample, using conventional manual or automated laboratory 
techniques. 

As used herein, the term "isolated" means oligonucleotides 
substantially free of other nucleic acids, proteins, lipids, carbohydrates or 
other materials with which they can be associated, such association being 

10 either in cellular material or in a synthesis medium. The term can also be 
applied to polypeptides, in which case the polypeptide will be substantially 
free of nucleic acids, carbohydrates, lipids and other undesired polypeptides. 

As used herein, the term "substantially pure" means that the 
polynucleotide or polypeptide is substantially free of the sequences and 

15 molecules with which it is associated in its natural state, and those molecules 
used in the isolation procedure. The term "substantially free" means that the 
sample is at least 50%, preferably at least 70%, more preferably 80% and 
most preferably 90% free of the materials and compounds with which is it 
associated in nature. 

20 As used herein, the term "primer* means a sequence comprising two or 

more deoxyribonucleotides or ribonucleotides, preferably more than three, 
and more preferably more than eight and most preferably at least about 20 
nucleotides of an exonic or intronic region. Such oligonucleotides are 
preferably between ten and thirty bases in length. 

25 As used herein, the term "gene" is used for simplicity to refer to a 

functional protein, polypeptide or peptide encoding unit. As will be understood 
by those in the art, this functional term includes both genomic sequences and 
cDNA sequences. Preferred embodiments of genomic and cDNA sequences 
are disclosed herein. 

30 As used herein, the term "DNA segment" means a DNA molecule that 

has been isolated free of total genomic DNA of a particular species. In a 
preferred embodiment, a DNA segment encoding a PPARa polypeptide refers 
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to a DNA segment that comprises SEQ ID NOs: 1 and 3, but can optionally 
comprise fewer or additional nucleic acids, yet is isolated away from, or 
purified free from, total genomic DNA of a source species, such as Homo 
sapiens. Included within the term "DNA segment" are DNA segments and 
5 smaller fragments of such segments, and also recombinant vectors, including, 
for example, plasmids, cosmids, phages, viruses, and the like. 

As used herein, the phrase "enhancer-promoter" means a composite 
unit that contains both enhancer and promoter elements. An enhancer- 
promoter is operatively linked to a coding sequence that encodes at least one 

10 gene product. 

As used herein, the phrase "operatively linked" means that an 
enhancer-promoter is connected to a coding sequence in such a way that the 
transcription of that coding sequence is controlled and regulated by that 
enhancer-promoter. Techniques for operatively linking an enhancer-promoter 

15 to a coding sequence are well known in the art; the precise orientation and 
location relative to a coding sequence of interest is dependent, inter alia, upon 
the specific nature of the enhancer-promoter. 

As used herein, the terms "candidate substance" and "candidate 
compound" are used interchangeably and refer to a substance that is believed 

20 to interact with another moiety, for example a given ligand that is believed to 
interact with a complete, or a fragment of, a PPAR polypeptide, and which can 
be subsequently evaluated for such an interaction. Representative candidate 
substances or compounds include xenobiotics such as drugs and other 
therapeutic agents, carcinogens and environmental pollutants, natural 

25 products and extracts, as well as endobiotics such as steroids, fatty acids and 
prostaglandins. Other examples of candidate compounds that can be 
investigated using the methods of the present invention include, but are not 
restricted to, agonists and antagonists of a PPAR polypeptide, toxins and 
venoms, viral epitopes, hormones (e.g., opioid peptides, steroids, etc.), 

30 hormone receptors, peptides, enzymes, enzyme substrates, co-factors, 
lectins, sugars, oligonucleotides or nucleic acids, oligosaccharides, proteins, 
small molecules and monoclonal antibodies. 
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As used herein, the term "biological activity" means any observable 
effect flowing from interaction between a PPAR polypeptide and a ligand. 
Representative, but non-limiting, examples of biological activity in the context 
of the present invention include dimerization of PPARa with RXR, 
5 phosphorylation, and association of PPARa with DNA. 

As used herein, the term "modified" means an alteration from an 
entity's normally occurring state. An entity can be modified by removing 
discrete chemical units or by adding discrete chemical units. The term 
"modified" encompasses detectable labels as well as those entities added as 
10 aids in purification. 

As used herein, the terms "structure coordinates" and "structural 
coordinates" mean mathematical coordinates derived from mathematical 
equations related to the patterns obtained on diffraction of a monochromatic 
beam of X-rays by the atoms (scattering centers) of a molecule in crystal 
15 form. The diffraction data are used to calculate an electron density map of the 
repeating unit of the crystal. The electron density maps are used to establish 
the positions of the individual atoms within the unit cell of the crystal. 

Those of skill in the art understand that a set of structure coordinates 
determined by X-ray crystallography is not without standard error. For the 
20 purpose of this invention, any set of structure coordinates for PPARa or a 
' PPARa mutant that have a root mean square (RMS) deviation of no more 
than 1.0 A when superimposed, using the polypeptide backbone atoms, on 
the structure coordinates listed in Table 2 shall be considered identical. 

As used herein, the term "space group" means the arrangement of 
25 symmetry elements of a crystal. 

As used herein, the term "molecular replacement" means a method 
that involves generating a preliminary model of the wild-type PPARa ligand 
binding domain, or a PPARa mutant crystal whose structure coordinates are 
unknown, by orienting and positioning a molecule whose structure 
30 coordinates are known (e.g., PPAR8) within the unit cell of the unknown 
crystal so as best to account for the observed diffraction pattern of the 
unknown crystal. Phases can then be calculated from this model and 
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combined with the observed amplitudes to give an approximate Fourier 
synthesis of the structure whose coordinates are unknown. This, in turn, can 
be subject to any of the several forms of refinement to provide a final, 
accurate structure of the unknown crystal. (Lattman, (1985) Method Enzymol., 
5 115: 55-77; Rossmann , ed, (1972) The Molecular Replacement Method , 
Gordon & Breach, New York.) Using the structure coordinates of the ligand 
binding domain of PPARa provided by this invention, molecular replacement 
can be used to determine the structure coordinates of a crystalline mutant or 
homologue of the PPARa ligand binding domain, or of a different crystal form 

1 0 of the PPARa ligand binding domain. 

As used herein, the terms "p-sheet" and "beta-sheet" mean the 
conformation of a polypeptide chain stretched into an extended zig-zig 
conformation. Portions of polypeptide chains that run "parallel" all run in the 
same direction. Polypeptide chains that are "antiparallel" run in the opposite 

1 5 direction from the parallel chains. 

As used herein, the terms "a-helix" and "alpha-helix" mean the 
conformation of a polypeptide chain wherein the polypeptide backbone is 
wound around the long axis of the molecule in a left-handed or right-handed 
direction, and the R groups of the amino acids protrude outward from the 

20 helical backbone, wherein the repeating unit of the structure is a single turnoff 
the helix, which extends about 0.56 nm along the long axis. 

As used herein, the term "unit cell" means a basic parallelipiped 
shaped block. The entire volume of a crystal can be constructed by regular 
assembly of such blocks. Each unit cell comprises a complete representation 

25 of the unit of pattern, the repetition of which builds up the crystal. Thus, the 
term "unit cell" means the fundamental portion of a crystal structure that is 
repeated infinitely by translation in three dimensions. A unit cell is 
characterized by three vectors a, b, and c, not located in one plane, which 
form the edges of a parallelepiped. Angles a, p and y define the angles 

30 between the vectors: angle a is the angle between vectors b and c; angle p is 
the angle between vectors a and c; and angle y is the angle between vectors 
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a and b. The entire volume of a crystal can be constructed by regular 
assembly of unit cells; each unit cell comprises a complete representation of 
the unit of pattern, the repetition of which builds up the crystal. 

As used herein, "orthorhombic unit cell" means a unit cell wherein 
5 a * b * c; and a = p = y = 90°. The vectors a, b and c describe the unit cell 
edges and the angles a, p, and y describe the unit cell angles. 

As used herein, the term "crystal lattice" means the array of points 
defined by the vertices of packed unit cells. 

As used herein, the term "PPAR" means any polypeptide sequence 

10 that can be aligned with at least one of human PPARct, PPARy or PPAR5 
such that at least 50% of the amino acids are identical to the corresponding 
amino acid in the human PPARa, PPARy or PPAR8. The term "PPAR* also 
encompasses nucleic acid sequences where the corresponding translated 
protein sequence can be considered to be a PPAR. The term "PPAR" 

15 encompasses at least the PPARa, PPARy and PPAR8 subtypes. The term 
"PPAR" includes invertebrate homologs; preferably, PPAR nucleic acids and 
polypeptides are isolated from eukaryotic sources. "PPAR" further includes 
vertebrate homologs of PPAR family members, including, but not limited to, 
mammalian and avian homolog. Representative mammalian homologs of 

20 PPAR family members include, but are not limited to, murine and human 
homologs. 

As used herein, the terms "PPARa gene product", "PPARa protein", 
"PPARa polypeptide", and "PPARa peptide" are used interchangeably and 
mean peptides having amino acid sequences which are substantially identical 

25 to native amino acid sequences from the organism of interest and which are 
biologically active in that they comprise all or a part of the amino acid 
sequence of a PPARa polypeptide, or cross-react with antibodies raised 
against a PPARa polypeptide, or retain all or some of the biological activity 
(e.g., DNA or ligand binding ability and/or dimerization ability) of the native 

30 amino acid sequence or protein. Such biological activity can include 
immunogenicity. 
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ln the present invention, the terms "PPARct gene product" , "PPARa 
protein", "PPARa polypeptide", and "PPARa peptide" are used 
interchangeably and mean to the preferred subtype of the PPAR family, 
namely PPARa, which comprises the amino acid sequence of SEQ ID NO: 2. 
5 As used herein, the terms "PPARa gene product", "PPARa protein", 

"PPARa polypeptide", and "PPARa peptide" also include analogs of a PPARa 

polypeptide. By "analogs is intended that a DNA or peptide sequence can 
contain alterations relative to the sequences disclosed herein, yet retain all or 
some of the biological activity of those sequences. Analogs can be derived 

10 from genomic nucleotide sequences as are disclosed herein or from other 
organisms, or can be created synthetically. Those skilled in the art will 
appreciate that other analogs, as yet undisclosed or undiscovered, can be 
used to design and/or construct PPARa analogs. There is no need for a 
"PPARa gene product", "PPARa protein", "PPARa polypeptide", or "PPARa 

15 peptide" to comprise all or substantially all of the amino acid sequence of a 
PPARa polypeptide gene product. Shorter or longer sequences are 
anticipated to be of use in the invention; shorter sequences are herein 
referred to as "segments". Thus, the terms "PPARa gene product", "PPARa 
protein", "PPARa polypeptide", and "PPARa peptide" also include fusion or 

20 recombinant PPARa polypeptides and proteins comprising sequences of the 
present invention. Methods of preparing such proteins are disclosed herein 
and are known in the art. 

As used herein, the term "polypeptide" means any polymer comprising 
any of the 20 protein amino acids, regardless of its size. Although "protein" is 

25 often used in reference to relatively large polypeptides, and "peptide" is often 
used in reference to small polypeptides, usage of these terms in the art 
overlaps and varies. The term "polypeptide" as used herein refers to 
peptides, polypeptides and proteins, unless otherwise noted. As used herein, 
the terms "protein", "polypeptide" and "peptide" are used interchangeably 

30 herein when referring to a gene product. 
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As used herein, the term "modulate" means an increase, decrease, or 
other alteration of any or all chemical and biological activities or properties of 
a wild-type or mutant PPAR polypeptide, preferably a wild-type or mutant 
PPARa polypeptide. The term "modulation" as used herein refers to both 
5 upregulation (i.e., activation or stimulation) and downregulation (i.e. inhibition 
or suppression) of a response. 

As used herein, the terms "binding pocket of the PPARa ligand binding 
domain", "PPARa ligand binding pocket" and "PPARa binding pocket" are 
used interchangeably, and refer to the large cavity within the PPARa ligand 

10 binding domain where Compound 1 binds. This cavity may be empty, or may 
contain water molecules or other molecules from the solvent, or may contain 
ligand atoms. The "main" binding pocket includes the region of space not 
occupied by atoms of PPARa that comprises residues lle-241, Leu-247, Ala- 
250, Glu-251 , Leu-254, Val-255, lle-272, Phe-273, Cys-275, Cys-276, Gin- 

15 277, Thr-279, Ser-280, Tyr-314, lle-317, Phe-318, Leu-321 , Met-330, Val-332, 
Ala-333, lle-339, Leu-344, lle-354, Met-355, His- 440, Val-444, Leu-456, Leu- 
460 and Tyr-464. The binding pocket also includes regions of space near 
the "main" binding pocket that not occupied by atoms of PPARa but that are 
near the "main" binding pocket, and that are contiguous with the "main" 

20 binding pocket. 

As used herein, the terms "PPAR gene" and "recombinant PPAR gene" 
mean a nucleic acid molecule comprising an open reading frame encoding a 
PPAR polypeptide of the present invention, including both exon and 
(optionally) intron sequences. 

25 As used herein, the term "gene" is used for simplicity to refer to a 

functional protein, polypeptide or peptide encoding unit. As will be understood 
by those in the art, this functional term includes both genomic sequences and 
cDNA sequences. Preferred embodiments of genomic and cDNA sequences 
are disclosed herein. 

30 As used herein, the term "DNA sequence encoding a PPAR 

polypeptide" can refer to one or more coding sequences within a particular 
individual. Moreover, certain differences in nucleotide sequences can exist 
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between individual organisms, which are called alleles. It is possible that 
such allelic differences might or might not result in differences in amino acid 
sequence of the encoded polypeptide yet still encode a protein with the same 
biological activity. As is well known, genes for a particular polypeptide can 
5 exist in single or multiple copies within the genome of an individual. Such 
duplicate genes can be identical or can have certain modifications, including 
nucleotide substitutions, additions or deletions, all of which still code for 
polypeptides having substantially the same activity. 

As used herein, the term "intron" means a DNA sequence present in a 

1 0 given gene that is not translated into protein. 

As used herein, the term "interact" means detectable interactions 
between molecules, such as can be detected using, for example, a yeast two 
hybrid assay. The term "interact' is also meant to include "binding" 
interactions between molecules. Interactions can, for example, be protein- 

1 5 protein or protein-nucleic acid in nature. 

As used herein, the terms "cells," "host cells" or "recombinant host 
ceils" are used interchangeably and mean not only to the particular subject 
cell, but also to the progeny or potential progeny of such a cell. Because 
certain modifications can occur in succeeding generations due to either 

20 mutation or environmental influences, such progeny might not, in fact, be 
identical to the parent cell, but are still included within the scope of the term as 
used herein. 

As used herein, the term "agonist" means an agent that supplements 
or potentiates the bioactivity of a functional PPAR gene or protein or of a 

25 polypeptide encoded by a gene that is up- or down-regulated by a PPAR 
polypeptide and/or a polypeptide encoded by a gene that contains a PPAR 
binding site or response element in its promoter region. 

As used herein, the term "antagonist" means an agent that decreases 
or inhibits the bioactivity of a functional PPAR gene or protein, or that 

30 supplements or potentiates the bioactivity of a naturally occurring or 
engineered non-functional PPAR gene or protein. Alternatively, an antagonist 
can decrease or inhibit the bioactivity of a functional gene or polypeptide 
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encoded by a gene that is up- or down-regulated by a PPAR polypeptide 
and/or contains a PPAR binding site or response element in its promoter 
region. An antagonist can also supplement or potentiate the bioactivity of a 
naturally occurring or engineered non-functional gene or polypeptide encoded 
5 by a gene that is up- or down-regulated by a PPAR polypeptide, and/or 
contains a PPAR binding site or response element in its promoter region. 

As used herein, the terms "chimeric protein" or "fusion protein" are 
used interchangeably and mean a fusion of a first amino acid sequence 
encoding a PPAR polypeptide with a second amino acid sequence defining a 

10 polypeptide domain foreign to, and not homologous with, any domain of a 
PPAR polypeptide. A chimeric protein can include a foreign domain that is 
found in an organism that also expresses the first protein, or it can be an 
"interspecies" or "intergenic" fusion of protein structures expressed by 
different kinds of organisms. In general, a fusion protein can be represented 

1 5 by the general formula X— PPAR— Y, wherein PPAR represents a portion of 
the protein which is derived from a PPAR polypeptide, and X and Y are 
independently absent or represent amino acid sequences which are not 
related to a PPAR sequence in an organism, which includes naturally 
occurring mutants. 

20 

IL Description of Tables 

Table 1 is a table summarizing the crystal and data statistics obtained 
from the crystallized ligand binding domain of PPARa. Data on the unit cell 
are presented, including data on the crystal space group, unit cell dimensions, 
25 molecules per asymmetric cell and crystal resolution. 

Table 2 is a table of the atomic structure coordinate data obtained from 
X-ray diffraction from the ligand binding domain of PPARa in complex with a 
ligand. 

Table 3 is a table of the atomic structure coordinate data obtained from 
30 X-ray diffraction from the ligand binding domain (residues 207-441) of a 
PPARS crystal ( Xu et al. , (1999) Mol. Cell 3: 397-403, PDB ID: 1GWX; 
Genbank Accession No. L07592; available online at http://www.rcsb.org/pdb/ ). 
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The coordinate data from the PPAR5 ligand binding domain were used in the 
molecular replacement solution of the PPARa ligand binding domain crystal 
form. 

Table 4 is a sequence alignment which shows sequence similarities 
5 between the PPARa, PPARy and PPAR8 sequences. The binding site 
residues are denoted by small boxes and represent those residues lying 
within 5.0 angstroms of the ligand. 

III. General Considerations 

10 The present invention will usually be applicable mutatis mutandis to all 

PPARs, as discussed herein based, in part, on the patterns of PPAR structure 
and modulation that have emerged as a consequence of determining the 
three dimensional structure of PPARa with bound ligand. Analysis and 
alignment of amino acid sequences, and X-ray and NMR structure 

15 determinations, have shown that nuclear receptors have a modular 
architecture with three main domains: 

1 ) a variable amino-terminal domain; 

2) a highly conserved DNA-binding domain (DBD); and 

3) a less conserved carboxy-terminal ligand binding domain (LBD). 

20 In addition, nuclear receptors may have linker segments of variable length 
between these major domains. Sequence analysis and X-ray crystallography, 
including the work of the present invention, have confirmed that PPARs also 
have the same general modular architecture, with the same three domains. 
The function of the PPARs in human cells presumably requires all three 

25 domains in a single amino acid sequence. However, the modularity of the 
PPARs permits different domains of each protein to separately accomplish 
certain functions. Some of the functions of a domain within the full-length 
receptor are preserved when that particular domain is isolated from the 
remainder of the protein. Using conventional protein chemistry techniques, a 

30 modular domain can sometimes be separated from the parent protein. Using 
conventional molecular biology techniques, each domain can usually be 
separately expressed with its original function intact or, as discussed herein 
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below, chimeras comprising two different proteins can be constructed, 
wherein the chimeras retain the properties of the individual functional domains 
of the respective nuclear receptors from which the chimeras were generated. 
The amino terminal domain of the PPAR subtypes is the least 
5 conserved of the three domains. This domain is involved in transcriptional 
activation and, in some cases, its uniqueness may dictate selective receptor- 
DNA binding and activation of target genes by PPAR subtypes. This domain 
can display synergistic and antagonistic interactions with the domains of the 
LBD. 

10 The DNA binding domain has the most highly conserved amino acid 

sequence amongst the PPARs. It typically contains about 70 amino acids that 
fold into two zinc finger motifs, wherein a zinc atom coordinates four 
cysteines. The DBD contains two perpendicularly oriented a-helixes that 
extend from the base of the first and second zinc fingers. The two zinc fingers 

15 function in concert along with non-zinc finger residues to direct the PPAR to 
specific target sites on DNA and to align receptor heterodimer interfaces. 
Various amino acids in the DBD influence spacing between two half-sites 
(which usually comprises six nucleotides) for receptor heterodimerization. 
The optimal spacings facilitate cooperative interactions between DBDs, and D 

20 box residues are part of the dimerization interface. Other regions of the DBD 
facilitate DNA-protein and protein-protein interactions required for RXR-PPAR 
heterodimerization. 

The LBD is the second most highly conserved domain in these 
receptors. As its name suggest, the LBD binds ligands. With many nuclear 

25 receptors, including the PPARs, binding of the ligand can induce a 
conformational change in the LBD that can, in turn, activate transcription of 
certain target genes. The LBD also participates in other functions, including 
dimerization and nuclear translocation. 

X-ray structures have shown that most nuclear receptor LBDs adopt 

30 the same general folding pattern. This fold consists of 10-12 alpha-helices 
arranged in a bundle, together with several beta-strands, additional alpha- 
helices and linking segments. The major alpha helices and beta-strands 
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have been numbered differently in different publications. This patent will 
follow the numbering scheme of Nolte et aL , ( Nolte et al. y (1998) Nature 
395:137-43), where the major alpha-helices and beta-strands are designated 
sequentially through the amino acid sequence as H1, H2, S1, H2\ H3, H3\ 
5 H4, H5, S2. S3, S4, H6, H7, H8, H9, H10 and HAF. The alpha-helix at the C- 
terminal end, HAF, is also called "helix-AF", "helix-AF2" or the "AF2 helix". 
Structural studies have shown that most of the alpha-helices and beta-strands 
have the same general position and orientation in all nuclear receptor 
structures, whether ligand is bound or not. However, the AF2 helix has been 

10 found in different positions and orientations relative to the main bundle, 
depending on the presence or absence of the ligand, and also on the 
chemical nature of the ligand. These structural studies have suggested that 
many nuclear receptors share a common mechanism of activation, where 
binding of activating ligands helps to stabilize the AF2 helix in a position and 

15 orientation adjacent to helices-3, -4, and -10, covering an opening to the 
ligand binding site. This position and orientation of the AF2 helix, which will 
be called the "active conformation", creates a binding site for coactivators. 
See, e.g. , Nolte et al„ (1998) Nature 395:137-43; Shiau et al„ (1998) Ce//95: 
927-37. This coactivator binding site has a central lipophilic pocket that can 

20 accommodate leucine side-chains from coactivators, as well as a "charge- 
clamp" structure consisting primarily of a lysine residue from helix-3 and a 
glutamic acid residue from the AF2 helix. Structural studies have shown that 
coactivator peptides containing the sequence LXXLL (where L is leucine and 
X can be a different amino acid in different cases) can bind to this coactivator 

25 binding site by making interactions with the charge clamp lysine and glutamic 
acid residues, as well as the central lipophilic region. This coactivator binding 
site is disrupted when the AF2 helix is shifted into other positions and 
orientations. In PPAR^y, activating ligands such as rosiglitazone (BRL49653) 
make a hydrogen bonding interaction with tyrosine-473 in the AF2 helix. 

30 Nolte et al. , (1998) Nature 395:137-43; Gampe et aL , (2000) Mol. Cell 5: 545- 
55. This interaction is believed to stabilize the AF2 helix in the active 
conformation, thereby allowing coactivators to bind and thus activating 
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transcription from target genes. With certain antagonist ligands, or in the 
absence of any ligand, the AF2 helix may be held less tightly in the active 
conformation, or may be free to adopt other conformations. This would either 
destabilize or disrupt the coactivator binding site, thereby reducing or 
5 eliminating coactivator binding and transcription from certain target genes. 
Some of the functions of the PPAR protein depend on having the full-length 
amino acid sequence and certain partner molecules, such as coactivators and 
DNA. However, other functions, including ligand binding and ligand- 
dependent conformational changes, may be observed experimentally using 
10 isolated domains, chimeras and mutant molecules. 

As described herein, the LBD of a PPAR can be expressed, 
crystallized, its three dimensional structure determined with a ligand bound as 
disclosed in the present invention, and computational methods can be used to 

r 

design ligands to its LBD. 

15 

IV. Synthesis of Compound 1 and Intermediates 

Compound 1, which was co-crystallized with the PPARa LBD in the 
present invention, can be conveniently prepared by a general process 
wherein a moiety like (A) is coupled to an acid (B) using a peptide coupling 
20 reaction or by alkylation of (A) using a suitable non nucleophilic amine with an 
acid chloride (C). Preferably, R is 1-6 alkyl, which can be hydrolyzed off or, is 
readily hydrolyzable. 

A B C 

A preferred synthesis of (A) when X, is O and X 2 is NH (and R 1 and R* 
25 are H) is: 

HO^V^ DMF/K 2 CO,/80-C J OW* jf 
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15 



Note that this synthesis is preferably carried out with the amine where 
the alcohol function is already alkylated with the acid side chain protected by 
R. For example, when n is1 , Xi is O, X 2 is NH, Y is S, Z is N, R 1 and R 2 are H, 
and R 3 is 4-F 3 C-phenyl : 



XT 



NH HO 
+ 




HOBT/EDC / NEtj 
DMF/rt 



ROOX^ 





CH 2 CI 2 / rt ^^P^q^K^ 




Some of the intermediates of type A are commercially available while 
10 others can be synthesized by techniques apparent to a person skilled in the 
art. The synthesis of intermediates of type B is illustrated below. 

Compound 1 can be made by an alternative method in which 
compounds of formula (D) are reacted with ethyl 2-bromo-2 methyl propionate 
to produce an ethyl ester, which may be hydrolyzed to produce the free acid. 

O 




20 



Compounds of formula (D) may be prepared from the reaction between 
compounds of formula (B) and compounds of formula (E) with HOBT / EDC / 
NEt 3 when X 2 is NH or NCH 3 or DIC / DMAP / NEta when X 2 is O. 
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The invention is further illustrated by the following examples which 
should not be construed as constituting a limitation thereto. 

IV.A. Synthesis of Intermediate 1 




Formation of Intermediate 1 follows the procedure described by Stout 
(Stout , (1983) J. Med. Chem. 26(6): 808-13). To 4-methoxybenzyl amine 
(25g, 0.18 mol; Aldrich) is added 46% HBr in H 2 0 (106ml, 0.9 mol; Aldrich). 
The reaction is refluxed overnight, then the reaction is cooled to 0°C and 
neutralized to pH7 slowly with KOH (s) . The reaction is allowed to stir for -30 
min, then the solid filtered and dried. The solid is redisolved in hot MeOH, 
filtered and the solution cooled to afford 19g (85%) intermediate 1. 



IV.B. Synthesis of Intermediate 2 




A solution of ethyl 2-chloroacetoacetate (35.3g, 29.7mL, 0.21 mol) and 
4-(trifluoromethyl)thiobenzamide (44g, 0.21 mol) in EtOH (300mL) is refluxed 
overnight. After cooling to room temperature the solvent is removed in vacuo. 
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The final product (Intermediate 2) is recrystallized from a minimum of MeOH 
to afford 40g (59%) of final product as a white solid. 

IV.C. Synthesis of Inter mediate 3 



To Intermediate 2 (1.84g, 5.8 mmol) in THF is added 1N LiOH (6mL, 6 
mmol) and the reaction stirred at room temperature. After ~3h, the reaction is 
neutralized with 1N HCI, extracted 3 x 100 mL EtOAc, dried over Na 2 S0 4 , 
filtered and the solvent removed under vaccum to afford 1.5g (89%) of 
10 Intermediate 3 as a white solid. 



15 (565mg, 4.2 mmol; Aldrich), EDC (800mg, 4.2 mmol; Aldrich) and 
Intermediate 1 (860mg, 7 mmol). The reaction is stirred at room temperature 
for 18h. The solvent is then removed in vacuo, treated with H2O and extracted 
3x 100mL CH2CI2. The organic phases are then combined and washed with 
1N HCI, dried over Na 2 S0 4 , filtered and evaporated to afford a mixture (/V- 

20 substituted and A/,0-substituted). The mixture is disolved in MeOH and 
treated with 1N NaOH. The reaction is stirred 18h at 50°C. The solvent 
removed in vacuo, dissolved in CH 2 CI 2l washed with H 2 0, and dried over 
Na 2 S0 4 . The solvent evaporated the residue chromatographed 
(CH 2 CI 2 /MeOH: 99/1) to afford 610mg (47%) of Intermediate 4 as a white 

25 solid. 



5 




IV.D. Synthesis of Intermediate 4 




To intermediate 3 (1g, 7 mmol) in CH 2 CI 2 /DMF (1:1) is added HOBT 
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IV.E. Synthesis of Intermediate 5 

O 




2-methyl-2-[4-{[(4-methyl-2-[4-trifluoromethylpheny^ 
ylcarbonyl)amino]methyl}phenoxy]propionic acid ethyl ester 
To Intermediate 4 (710mg, 181 mmol) in DMF (50mL) is added the 
K 2 C0 3 (275mg, 1.99 mmol) followed by the ethyl 2-bromo-2-methylpropanate 
(280|jL f 1 .91 mmol; Aldrich) and the reaction is heated to 80°C. After 18h, the 
reaction is cooled to room temperature and the solvent removed in vacuo. 
The residue is treated with water (200 mL), extracted 3 x 50mL CH 2 CI 2 , dried 
over Na2S0 4 , filtered and the solvent removed under vaccum to afford 680mg 
(77%) of Intermediate 5 as a clear oil. 

IV.F. Synthesis of Compound 1 



O 




2-methyl-2-[4-{[(4-methyl-2-[4-trifluoromethylphenylHhiazol-5- 
ylcarbonyl)amino]methyl}phenoxy]propionic acid 
To Intermediate 5 (680mg, 1.39 mmol) in MeOH is added 1N NaOH 
(1.6 mL, 1 .6 mmol) and the reaction is stirred at 60°C. After 18h, the reaction 
is cooled to room temperature and the solvent evaporated. The residue 
treated with 1N HCI, extracted 3 x 20 mL THF and the solvent is removed 
under vacuum to afford 500mg (75%) of Compound 1. 
V. Production of PPAR Polypeptides 
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The native and mutated PPAR polypeptides, and fragments thereof, of 
the present invention can be chemically synthesized in whole or part using 
techniques that are well-known in the art ( See, e.g. , Creighton , (1983) 
Proteins: Structures and Molecular Principles , W.H. Freeman & Co., New 
5 York, incorporated herein in its entirety). Alternatively, methods which are 
well known to those skilled in the art can be used to construct expression 
vectors containing a partial or the entire native or mutated PPAR polypeptide 
coding sequence and appropriate transcriptional/translational control signals. 
These methods include in vitro recombinant DNA techniques, synthetic 

10 techniques and in vivo recombination/genetic recombination. See, for 
example, the techniques described throughout Sambrook et al. , (1989) 
Molecular Cloning: A Laboratory Manual , Cold Spring Harbor Laboratory, New 
York, and Ausubel et al. , (1989) Current Protocols in Molecular Biology , 
Greene Publishing Associates and Wiley Interscience, New York, both 

1 5 incorporated herein in their entirety. 

A variety of host-expression vector systems can be utilized to express 
a PPAR coding sequence. These include but are not limited to 
microorganisms such as bacteria transformed with recombinant 
bacteriophage DNA, plasmid DNA or cosmid DNA expression vectors 

20 containing a PPAR coding sequence; yeast transformed with recombinant 
yeast expression vectors containing a PPAR coding sequence; insect cell 
systems infected with recombinant virus expression vectors (e.g., baculovirus) 
containing a PPAR coding sequence; plant cell systems infected with 
recombinant virus expression vectors (e.g., cauliflower mosaic virus, CaMV; 

25 tobacco mosaic virus, TMV) or transformed with recombinant plasmid 
expression vectors (e.g., Ti plasmid) containing a PPAR coding sequence; or 
animal cell systems. The expression elements of these systems vary in their 
strength and specificities. 

Depending on the host/vector system utilized, any of a number of 

30 suitable transcription and translation elements, including constitutive and 
inducible promoters, can be used in the expression vector. For example, 
when cloning in bacterial systems, inducible promoters such as pL of 
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bacteriophage X, plac, ptrp, ptac (ptrp-lac hybrid promoter) and the like can be 
used. When cloning in insect cell systems, promoters such as the baculovirus 
polyhedrin promoter can be used. When cloning in plant cell systems, 
promoters derived from the genome of plant cells, such as heat shock 
5 promoters; the promoter for the small subunit of RUBISCO; the promoter for 
the chlorophyll a/b binding protein) or from plant viruses (e.g., the 35S RNA 
promoter of CaMV; the coat protein promoter of TMV) can be used. When 
cloning in mammalian cell systems, promoters derived from the genome of 
mammalian cells (e.g., metallothionein promoter) or from mammalian viruses 
10 (e.g., the adenovirus late promoter; the vaccinia virus 7.5K promoter) can be 
used. When generating cell lines that contain multiple copies of the tyrosine 
kinase domain DNA, SV40-, BPV- and EBV-based vectors can be used with 
an appropriate selectable marker. 



15 VL Formation of PPARa Ligand Binding Domain Crystals 

In one embodiment, the present invention provides crystals of 
PPARa LBD. The crystals were obtained using the methodology disclosed in 
the Examples. The PPARa LBD crystals, which can be native crystals, 
derivative crystals or co-crystals, have orthorhombic unit cells (an 

20 orthorhombic unit cell is a unit cell wherein a * b * c, and wherein a = (J = y = 
90°) and space group symmetry P2i2i2. There is one PPARa LBD molecule 
in the asymmetric unit. In this PPARa crystalline form, the unit cell has 
dimensions of a= 61.3 A, b= 103.5 A, c= 49.9 A, and a = p = y = 90°. This 
crystal form can be formed in a crystallization reservoir comprising 4-8% PEG 

25 3350, 1 00-200mM NaF, and 1 2-1 6% 2,5 hexanediol. 

In another PPARa crystal form, the unit cell has dimensions of a = 
95.58 A, b = 122.06 A, c = 122.10 A, a = p = y = 90° and belongs to the space 
group P2i2i2!. There are four PPARa LBD molecules in the asymmetric unit 
of this crystal form. This crystal form can be formed in a crystallization 

30 reservoir comprising 50 mM bis-tris-propane, 4-6% PEG 3350, 150 mM 
NaN0 3 , 16% 2,5 hexanediol, and 1-3 mM YCI 3 . 
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VIA Preparation of PPAR Crystals 

The native and derivative co-crystals, and fragments thereof, disclosed 
in the present invention can be obtained by a variety of techniques, including 
5 batch, liquid bridge, dialysis, vapor diffusion and hanging drop methods ( See, 
e.g. , McPherson , (1982) Preparation and Analysis of Protein Crystals , John 
Wiley, New York.; McPherson , (1990) Eur. J. Biochem. 189:1-23.; Weber , 
(1991) Adv. Protein Chem. 41:1-36). In a preferred embodiment, the vapor 
diffusion and hanging drop methods are used for the crystallization of PPAR 

1 0 polypeptides and fragments thereof. 

In general, native crystals of the present invention are grown by 
dissolving substantially pure PPAR polypeptide or a fragment thereof in an 
aqueous buffer containing a precipitant at a concentration just below that 
necessary to precipitate the protein. Water is removed by controlled 

15 evaporation to produce precipitating conditions, which are maintained until 
crystal growth ceases. 

In a preferred embodiment of the invention, native crystals are grown 
by vapor diffusion (See, e.g. , McPherson , (1982) Preparation and Analysis of 
Protein Crystals , John Wiley, New York.; McPherson , (1990) Eur. J. Biochem. 

20 189:1-23). In this method, the polypeptide/precipitant solution is allowed to 
equilibrate in a closed container with a larger aqueous reservoir having a 
precipitant concentration optimal for producing crystals. Generally, less than 
about 25 |oL of PPAR polypeptide solution is mixed with an equal volume of 
reservoir solution, giving a precipitant concentration about half that required 

25 for crystallization. This solution is suspended as a droplet underneath a 
coverslip, which is sealed onto the top of the reservoir. The sealed container 
is allowed to stand, until crystals grow. Crystals generally form within two to 
six weeks, and are suitable for data collection within approximately seven to 
ten weeks. Of course, those of skill in the art will recognize that the above- 

30 described crystallization procedures and conditions can be varied. 



VLB. Preparation of Derivative Crystals 
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Derivative crystals of the present invention, e.g. heavy atom derivative 
crystals, can be obtained by soaking native crystals in mother liquor 
containing salts of heavy metal atoms. Such derivative crystals are useful for 
phase analysis in the solution of crystals of the present invention. In a 
5 preferred embodiment of the present invention, for example, soaking a native 
crystal in a solution containing methyl-mercury chloride provides derivative 
crystals suitable for use as isomorphous replacements in determining the X- 
ray crystal structure of a PPAR polypeptide. Additional reagents useful for the 
preparation of the derivative crystals of the present invention will be apparent 
10 to those of skill in the art after review of the disclosure of the present invention 
presented herein. 

«■ 

VI.C Preparation of Co-crystals 

Co-crystals of the present invention can be obtained by soaking a 
15 native crystal in mother liquor containing compounds known or predicted to 
. bind the LBD of a PPAR, or a fragment thereof. Alternatively, co-crystals can 
be obtained by co-crystallizing a PPAR LBD polypeptide or a fragment thereof 
in the presence of one or more compounds known or predicted to bind the 
polypeptide. In a preferred embodiment, such a compound is Compound 1. 

20 

Vl.D. Solving a Crystal Structure of the Present Invention 
Crystal structures of the present invention can be solved using a 
variety of techniques including, but not limited to, isomorphous replacement, 
anomalous scattering or molecular replacement methods. Computer software 

25 packages will also be helpful in solving a crystal structure of the present 
invention. Applicable software packages include but are not limited to X- 
PLOR™ program (Brunger , (1992) X-PLOR, Version 3.1 A System for X-ray 
Crystallography and NMR, Yale University Press, New Haven, Connecticut; 
X-PLOR is available from Molecular Simulations, Inc., San Diego, California), 

30 Xtal View (McRee , (1992) J. Mol. Graphics 10: 44-47; X-tal View is available 
from the San Diego Supercomputer Center). SHELXS 97 (Sheldrick (1990) 
Acta Cryst. A46: 467; SHELX 97 is available from the Institute of Inorganic 
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Chemistry, Georg-August-Universitat, Gdttingen, Germany), HEAVY 
(Terwilliger, Los Alamos National Laboratory) and SHAKE-AND-BAKE 
(Hauptman , (1997) Curr. Opin. Struct. Biol. 7: 672-80; Weeks et al. , (1993) 
Acta Cryst. D49: 179; available from the Hauptman-Woodward Medical 
5 Research Institute, Buffalo, New York) can be used. See also , Ducruix & 
Geige, (1992) Crystallization of Nucleic Acids and Proteins: A Practical 
Approach , IRL Press, Oxford, England, and references cited therein. 

VII. Characterization and Solution of a PPARa Liqand Binding Domain 
10 Crystal 

VI l. A Unique Structural Differences Between PPARa and Other 
PPARs 

The PPARa LBD-ligand structure was solved here using molecular 

15 replacement techniques. The overall folding of the protein backbone, and the 
binding mode of Compound 1 , are shown in the ribbon diagram of Figure 1 . 
Specific interactions between Compound 1 and the protein are shown 
schematically in Figure 2. The structure of the PPARy LBD was solved 
previously in the apo form, i.e., with no ligand, and also with a 

20 thiazolidinedione ligand (rosiglitazone) and a coactivator peptide ( Nolte et al. , 
(1998) Nature 395:137-43). The apo structure has also been determined 
independently ( Uppenberg et al. , (1998) J. Biol. Chert). 273: 31108-12). In 
addition, the structure of the PPARy LBD has been determined with a partial 
agonist, GW0072 (Oberfield et al. , (1999) Proc. Nat. Acad. Sci. 96: 6102-106). 

25 The structure of the PPARy LBD has also been determined in the 
heterodimeric complex with RXRa, together with coactivator peptides, with 
rosiglitazone, and also with a carboxylic acid ligand, GI262570 (Gampe et al. t 
(2000) Mol. Cell 5: 545-55). The structure of the PPAR8 LBD has been 
determined in the apo form, and with eicosapentaenoic acid, and with the 

30 . synthetic compound GW2433 (Xu et al. , (1999) Mot. Cell 3: 397-403). 
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To facilitate comparison, the structures were first translated and rotated 
into a common position and orientation as shown in Figure 1. This 
superimposition operation was done with the MVP program ( Lambert , (1997) 
in Practical Application of Computer-Aided Drug Design , (Charifson, ed.), pp. 
5 243-303, Marcel-Dekker, New York) by first aligning the amino acid 
sequences to identify corresponding residues in the three different PPARs, 
and then rotating and translating so as to superimpose corresponding C a or 
backbone atoms from the aligned residues. With this translation and rotation, 
most of the major alpha-helices and beta-strands of the three PPARs are 

10 closely superimposed. The AF2 helix is well superimposed in most of the 
structures, but is shifted into a different position in certain subunits of 
structures of PPARy either in the absence of ligand (apo), or in the presence 
of the partial agonist GW0072. This shift provides evidence that the PPAR 
AF2 helix can shift out of the active conformation, and that it is more likely to 

15 do so in the absence of a strongly activating ligand. In the above listed PPAR 
X-ray structures, the ligands that act as strong agonists (rosiglitazone, 
GI262570, eicosapentaenoic acid, GW2433, Compound 1) are all oriented 
with the acid group near the AF2 helix, and they all make a hydrogen bond 
with a tyrosine residue in the AF2 helix (Tyr464 in PPARa, Tyr473 in PPARy, 

20 Tyr437 in PPARS). The partial agonist, GW0072, also has a carboxyl group, 
but it is oriented differently, and fails to make any hydrogen bond with the AF2 
tyrosine. This suggests that the hydrogen bond between the ligand and the 
AF2 tyrosine helps to hold the AF2 helix in the active conformation and 
thereby facilitate coactivator binding. See, Xu et al. , (1999) Mol. Cell 3: 397- 

25 403 and Oberfield et al. , (1999) Proc. Nat Acad. Sci. 96: 6102-106. 

To facilitate discussion, it is useful to establish a nomenclature for 
regions of the ligand and the ligand binding pocket. The strongly activating 
ligands in these X-ray structures all have an acid group that binds near the 
AF2 tyrosine, and a lipophilic "tail" that extends into a lipophilic pocket. The 

30 acid group can be called the acid "headgroup", and the site near the AF2 helix 
into which it binds can be called the "headgroup binding site". In rosiglitazone, 
GI262570 and Compound 1, the lipophilic tail is directed into a lipophilic 
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pocket delineated by helix-2\ helix-3 and the beta sheet. This will be called 
the "lower tail pocket" or "lower pocket". In the structure of PPAR5 bound to 
eicosapentaenoic acid, the eicosapentaenoic acid was found in two different 
binding modes with roughly equal occupancy (Xu et a). , (1999) Mol. Cell 3: 
5 397-403). In both binding modes, the acid headgroup bound in the 
headgroup binding site near Tyr 437. In one binding mode, the lipophilic tail 
was directed into the lower tail pocket. However, in the other binding mode, 
the lipophilic tail was directed into a different pocket, delineated by helix-3, 
helix-5, the beta sheet and the loop between helix-1 and helix-2. This will be 

10 called the "upper tail pocket." GW2433 has a branched tail, and was found to 
bind with one branch in the lower pocket, and with the other branch in the 
upper pocket (Xu et al. , (1999) Mol. Cell 3: 397-403). GI262570 has an 
additional lipophilic benzophenone group near the acid headgroup. This 
benzophenone group was found to be directed into a lipophilic pocket 

15 delineated by helix-3, helix-7, helix-1 0 and the loop between helix-1 0 and the 
AF2 helix. This will be subsequently referred to as the "benzophenone 
pocket." 

Comparing the backbone structures, there are substantial differences 
between PPARcc, PPARy and PPAR5 in the "loop" region between helix-2 and 

20 strand-1, the loop region between helix-2' and helix-3, and the loop region 
between helix-1 0 and the AF2 helix. There are also smaller differences in the 
loop region between helix-1 and helix-2, and the loop region between helix-6 
and helix-7. Helices 2, 2' and 3' are closely superimposed, but have slightly 
different lengths, as indicated in Table 4. Helix-1 0 has the same length in all 

25 structures, but bends differently over the ligand binding pocket in PPARoc, 
PPARy and PPAR8. Some of these structural differences are visible in Figure 
3. Many of these structural differences lie close to the ligand binding pocket, 
and could therefore be important in the receptor selectivity of different ligands. 
Helix-2' and the loop between helix-2' and helix-3 (the 2-3 loop) 

30 together serve as one wall for the "lower tail pocket" in all three PPARs. 
Helix-2', and the 2-3 loop, adopt different conformations in some of the 
different PPARy structures. In particular, helix-2' is shorter in the homodimer 
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structures of PPARy bound to rosiglitazone and GW0072, and in the absence 
of ligand, than it is in the RXRo/PPARy heterodimer structures where 
PPARy is bound to rosiglitazone or GI262570. Some of these conformational 
differences might result from differences in crystal packing. However, the 
5 RXRa/PPARy heterodimer represents the closer approximation to the 
biologically active complex, and will be used here to represent the most 
relevant conformation of PPARy. By contrast, helix-2' and the 2'-3 loop adopt 
more nearly equivalent backbone conformations in the available X-ray 
structures of PPARa and PPARS. Considering the conformation in the 

10 RXRo/PPARy/GI262570 heterodimer structure, helix-2' is longer in PPARa 
and PPARS, and the C-terminal end of the helix adopts different 
conformations in the three . PPARs. In PPARa, the last residue of the helix, 
Leu254, makes the expected alpha-helical hydrogen bonds, but bulges away 
from the axis of the helix towards the ligand. This bulged conformation allows 

15 Leu254 and Val255 to cover the "bottom" of the lower tail pocket, effectively 
narrowing the mouth of the pocket. In PPAR5, the C-terminal end of helix-2' 
is wound more loosely, such that the corresponding residue, Leu226, cannot 
cover the bottom of the ligand binding pocket. Instead, the 2'-3 loop adopts a 
different conformation that places the side-chain of Trp228 in the bottom of 

20 the lower tail pocket. This shortens the lower tail pocket slightly, and also 
constricts the opening to solvent. In PPARy, helix-2 1 is bent such that its 
corresponding residue, Ile262, is shifted farther from the pocket. Also, the 
next two residues, Lys263 and Phe264, adopt the helical conformation, 
effectively extending helix-2 by two residues, such that Lys263 is far from the 

25 ligand and cannot cover the solvent channel. This leaves a very wide 
opening to solvent at this position in PPARy, as depicted by the large, wide 
cavity region above the 2'-3 loop in Figure 5. The opening is much narrower 
in PPARa and PPARS, as depicted in Figures 4 and 6. These variations in 
the backbone conformation would be difficult or impossible to predict from the 

30 previously available X-ray structures with any homology modeling procedure. 
Nonetheless, very accurate models for helix-2' and the 2'-3 loop in all three 
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PPARs would be essential for understanding the binding mode and receptor 
selectivity of ligands that occupy the lower tail pocket. 

The loop between helix-1 and helix-2 (the "1-2 loop") serves as a wall 
at the far end of the upper tail pocket. Residues in this 1-2 loop come close to 
5 the tail of eicosapentaenoic acid (in its upper tail pocket binding mode), and 
close to the fluoro-,chloro-phenyl ring of GW2433 in PPAR5. Several 
backbone amide CO and NH groups are directed towards the ligand binding 
site, and could serve as hydrogen bonding partners with an appropriately 
designed ligand. However, the backbone conformation in the 1-2 loop is 

10 slightly different in PPARa, PPARy and PPAR5. In particular, the backbone 
amide CO and NH groups have slightly different orientations in PPARa, 
PPARy and PPAR8. Some of these differences may result from the presence 
of a proline at position 227 in PPARy, whereas PPARa and PPAR8 have 
asparagines at this position. These subtle differences would be difficult or 

15 impossible to predict to high accuracy using the previously available X-ray 
structures with standard homology modeling procedures. Nonetheless, a 
very accurate model of this loop conformation would be essential for 
understanding the receptor selectivity of ligands that interact with this loop. 

The C-terminal end of helix-1 0 and the loop between helix-1 0 and the 

20 AF2 helix (the "10-AF2 loop") together serve as one wall for the headgroup 
binding site and the benzophenone pocket. An additional wall is provided by 
a glutamine residue in helix-3 (Gln277 in PPARa, Gln286 in PPARy, Gln250 
in PPAR5) that reaches across the benzophenone binding site to make 
hydrogen bonds with backbone CO and NH groups from the 10-AF2 loop. 

25 This glutamine side-chain, and the 10-AF2 loop itself, adopt different 
conformations in the three PPARs. In PPARa and PPAR8, the glutamine 
side-chain adopts conformations that narrow the benzophenone pocket 
substantially. In addition, in PPARa and PPAR8, the 10-AF2 loop adopts a 
conformation that crowds a phenylalanine from helix-3 (Phe273 in PPARa, 

30 Phe282 in PPARy, Phe246 in PPAR8) more in PPARa and PPARS than in 
PPARy. These two structural differences cause a slight narrowing of the 
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benzophenone pocket in PPARct, and a more substantial narrowing in 
PPARS. This can be seen by comparison of PPARct and PPAR8 in Figures 4 
and 6 with PPARy in Figure 5. The narrowing is sufficient to significantly 
reduce the binding affinity of ligands that have a benzophenone group at this 
5 position. The narrowing in PPARa and PPAR5 suggests that smaller groups 
should be used at this position to obtain good binding to PPARa, and 
substantially smaller groups should be used to obtain good binding to PPARS. 
Aside from these changes that narrow the pocket in PPARa and PPARS, 
there are also structural changes that modulate the shape of the pocket and 

10 the position and orientation of potential hydrogen bonding groups. For 
example, the C-terminal end of helix-10 bends inwards toward the ligand 
binding site in all three PPARs. However, the bend is slightly different in the 
different PPARs, placing corresponding side-chains in slightly different 
positions. These variations in the backbone conformation, and corresponding 

15 changes in side-chain conformation, would be difficult or impossible to predict 
from the previously available X-ray structures with any available homology 
modeling procedure. Nonetheless, very accurate models for helix-10, the 10- 
AF2 loop and the side-chains in this region would be essential for 
understanding the binding mode and receptor selectivity of ligands that 

20 occupy the benzophenone pocket. 

Aside from these unexpected differences in the backbone structure of 
PPARa, the present X-ray structure also revealed differences involving side- 
chains. In some cases, these differences involve residue positions where the 
amino acid is different in PPARa from that in either PPARy or PPARS or both. 

25 In other cases, PPARa may have the same amino acid at a particular residue 
position as PPARy or PPAR8 or both, but the amino acid side-chain adopts a 
different conformation in PPARa. 

One of the most important side-chain differences is Tyr314 in PPARa, 
a position that corresponds to His323 in PPARy and His287 in PPARS. In 

30 PPARy and PPARS, this histidine residue makes a hydrogen bond with the 
acidic headgroup of the ligand for all strongly activating ligands for which 
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staictures are available. The present PPARa structures shows that Tyr314 
also makes a hydrogen bond with the acidic headgroup of Compound 1. 
However, the tyrosine OH lies farther from the protein backbone than the 
corresponding hydrogen bonding atoms in histidine. Consequently, changing 
5 histidine to tyrosine could potentially require a change in the position and/or 
orientation and/or conformation of the ligand, or changes in the conformation 
of the PPARa protein. The present PPARa crystal structure showed that the 
protein backbone is essentially unchanged in this region of PPARa, compared 
with PPARy and PPAR8. Instead, the acid headgroup of the ligand is shifted 

10 and rotated in such a way that it can still make hydrogen bonds with Ser280, 
Tyr314, His440 and Tyr464. This involves small changes in the 
conformations of Tyr314, His440 and Tyr464, relative to PPARy and PPAR8. 
Also, this involves a larger change in the conformation of Ser280, which 
unexpectedly adopts a conformation different from that of Ser289 in 

15 PPARy bound to GI262570. The PPARa conformation places the Ser280 
side-chain oxygen near the position of the side-chain oxygen of Thr253 in 
PPAR8 bound to GW2433. The exact shift and rotation, and the exact 
conformations of the side-chains of Ser280, Tyr314, His440 and Tyr464, 
would be difficult or impossible to predict without this X-ray crystal structure of 

20 PPARa. However, this shift and rotation do significantly affect the position of 
the whole ligand within the ligand binding site. The position and orientation of 
the ligand carboxylate group revealed in this PPARa X-ray structure, and the 
interactions it makes with Ser280, Tyr314, His440 and Tyr464, can serve as a 
template for docking other compounds into PPARa using molecular modeling 

25 procedures. 

This PPARa X-ray structure also revealed numerous other side-chains 
where the conformation in PPARa was different from that in PPARy and/or 
PPAR8, and where the diffence could affect the shape of the ligand binding 
pocket, and the position, orientation and/or conformation of the ligand. These 
30 additional residues with differences include, but are not limited to, GIn277, 
Phe273, His274, Ile354, Leu321, Met320, Met330 and Glu251. The side- 
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chain conformational differences involving Gln277, Phe273 and Ile354 affect 
the volume and detailed shape of the headgroup binding site and/or the 
benzophenone pocket. His274 may affect these pockets indirectly, through its 
effect on Gln277 and Phe273. Leu321, Met320, Met330 and Glu251 affect 
5 the shape and volume of the upper and lower tail pockets. Numerous other 
side-chains also affect the size, shape and electrostatic character of the 
ligand binding site, and the position, orientation and conformation of the ligand 
within the ligand binding site. An understanding of the PPARa, PPARy and 
PPAR6 selectivity of various ligands would depend on having an accurate 
10 structure of each of the three PPARs, as well as an accurate position of the 
whole ligand within the pocket. This PPARa X-ray structure provides an 
accurate protein structure, as well as a template for modeling alternative 
ligands. 

15 VH.B Characterization of the PPARg Binding Pocket 

The ligand binding domain of PPARa was co-crystallized with 
Compound 1, which has the IUPAC name 2-methyl-2-[4-{[(4-methyl-2-[4- 
trifluoromethylphenyl] thiazol-5-yl-carbonyl) amino] methyl} phenoxy] propionic 
acid. 



Compound 1 is an agonist of hPPARoc and is useful for treatment of 
hPPARa mediated diseases or conditions including dyslipidemia, syndrome 
25 X, heart failure, hypercholesteremia, cardiovascular disease, type II diabetes 



20 




Compound 1 
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mellitus, type I diabetes, insulin resistance, hyperlipidemia, obesity, 
inflammation, anorexia bulimia and anorexia nervosa. 

Figures 1 , 2 and 7 depict the conformation and orientation of the ligand 
Compound 1 in the binding site. Figure 1 depicts the overall orientation of the 
5 a helices and p strands of the ligand binding domain of PPARa as it binds 
Compound 1. Compound 1 is presented as a spacefilling model. 

A more specific graphical description of the residues of the PPARa 
LBD involved in ligand binding is presented in Figure 2. Figure 2 is a 
schematic diagram depicting those residues of the PPARa LBD that interact 
10 with Compound 1 as it is bound in the binding pocket of the PPARa LBD. 
Note that Figure 2 is a schematic diagram, and highlights residues that 
interact with the ligand and does not indicate intermolecular distances. 

V1LC. Hydrogen Bonding in the Binding Pocket of the PPARa LBD 
15 The hydrogen bonding scheme of the solvated binding pocket is 

presented in Figure 7. In Figure 7, atoms are shaded according to element, 
with carbon, fluorine, nitrogen, oxygen, sulfur and hydrogen in progressively 
lighter shades of gray. Sulfur is depicted with a slightly bigger ball, while 
hydrogen, fluorine and oxygen are depicted with slightly smaller balls. The 
20 hydrogen atoms shown here were not visible in the electron density, and were 
instead modeled into reasonable conformations (using standard bond lenghts 
and angles) to obtain possible hydrogen bond interactions, shown here with 
strings of small white balls. Each interaction is annotated with its distance in 
angstroms. Residues of the PPARa LBD binding pocket involved in hydrogen 
25 bonding are labeled. It is observed that hydrogen bonding relationships exist 
between the side chains of PPARa LBD residues and the ligand, between the 
side chains of PPARa LBD residues and solvent molecules, and between the 
ligand and solvent molecules. The hydrogen bonding pairs are identified by 
dotted lines. 

30 Figure 7 highlights hydrogen bonding that occurs between solvent 

molecules (indicated as crosses in Figure 7) and the side chains of PPARa 
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LBD residues in the binding pocket. Residues that hydrogen bond to solvent 
and are visible in Figure 7 include Ser-280, Ser-283, Thr-279 and Ala-333. 

Additionally, Figure 7 highlights those residues of the PPARa LBD that 
hydrogen bond directly to the ligand. Residues of the PPARa LBD in which 
5 side chains hydrogen bond directly to the ligand are visible in Figure 7, and 
include His-440, Tyr-464, Tyr-314, Ser280 and Thr-279. 

Vll-D. Generation of Easily-Solved PPAR Crystals 

The present invention discloses a substantially pure PPAR LBD 

1 0 polypeptide in crystalline form. In a preferred embodiment, exemplified in the 
Figures and Laboratory Examples, PPARa is crystallized with bound ligand. 
Crystals are formed from PPAR LBD polypeptides that are usually expressed 
by a cell culture, such as E. coli. Bromo-, iodo- and substitutions can be 
included during the preparation of crystal forms and can act as heavy atom 

15 substitutions in PPAR ligands and crystals of PPARs. This method can be 
advantageous for the phasing of the crystal, which is a crucial, and sometimes 
limiting, step in solving the three-dimensional structure of a crystallized entity. 
Thus, the need for generating the heavy metal derivatives traditionally 
employed in crystallography can be eliminated. After the three-dimensional 

20 structure of a PPAR or PPAR LBD with or without a ligand bound is 
determined, the resultant three-dimensional structure can be used in 
computational methods to design synthetic ligands for PPARa and other 
PPAR polypeptides. Further activity structure relationships can be 
determined through routine testing, using assays disclosed herein and known 

25 in the art. 

VIII. Uses of PPARa Crystals and the Three-Dimensional Structure of the 
Ligand Binding Domain of PPARa 

30 VIII.A. Design and Development of PPAR Modulators 

The knowledge of the structure of the PPARa ligand binding domain 
(LBD), an aspect of the present invention, provides a tool for investigating the 
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mechanism of action of PPARa and other PPAR polypeptides in a subject. 
For example, various computer modelleing programs, as described herein, 
can predict the binding of various ligand molecules to the LBD of PPARa, 
PPARy or PPAR8. Upon discovering that such binding in fact takes place, 
5 knowledge of the protein structure then allows design and synthesis of small 
molecules that mimic the functional binding of the ligand to the LBD of 
PPARa, and to the LBDs of other PPAR polypeptides. This is the method of 
"rational" drug design, further described herein. 

Use of the isolated and purified PPARa crystalline structure of the 
10 present invention in rational drug design is thus provided in accordance with 
the present invention. Additional rational drug design techniques are 
described in U.S. Patent Nos. 5,834,228 and 5,872,011, incorporated herein 
in their entirety. 

Thus, in addition to the compounds described herein, other sterically 
15 similar compounds can be formulated to interact with the key structural 
regions of a PPAR in general, or of PPARa in particular. The generation of a 
structural functional equivalent can be achieved by the techniques of 
modeling and chemical design known to those of skill in the art and described 
herein. It will be understood that all such sterically similar constructs fall 
20 within the scope of the present invention. 

VIU.A.1. Rational Drug Design 

The three-dimensional structure of ligand-binding PPARa is 
unprecedented and will greatly aid in the development of new synthetic 

25 ligands for a PPAR polypeptide, such as PPAR agonists and antagonists, 
including those that bind exclusively to any one of the PPAR subtypes. In 
addition, the PPARs are well suited to modern methods, including three- 
dimensional structure elucidation and combinatorial chemistry, such as those 
disclosed in U.S. Patent No. 5,463,564, incorporated herein by reference. 

30 Structure determination using X-ray crystallography is possible because of the 
solubility properties of the PPARs. Computer programs that use 
crystallography data when practicing the present invention will enable the 
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rational design of ligands to these receptors. Programs such as RASMOL 
(Biomolecular Structures Group, Glaxo Wellcome Research & Development 
Stevenage, Hertfordshire, UK Version 2.6, August 1995, Version 2.6.4, 
December 1998, Copyright © Roger Sayle 1992-1999) can be used with the 
5 atomic structural coordinates from crystals generated by practicing the 
invention or used to practice the invention by generating three-dimensional 
models and/or determining the structures involved in ligand binding. 
Computer programs such as those sold under the registered trademark 
INSIGHT II® and such as GRASP (Nicholls et al. , (1991) Proteins 11: 282) 
10 allow for further manipulations and the ability to introduce new structures. In 
addition, high throughput binding and bioactivity assays can be devised using 
purified recombinant protein and modem reporter gene transcription assays 
known to those of skill in the art in order to refine the activity of a designed 
ligand. 

15 A method of identifying modulators of the activity of a PPAR 

polypeptide using rational drug design is thus provided in accordance with the 
present invention. The method comprises designing a potential modulator for 
a PPAR polypeptide of the present invention that will form non-covalent 
interactions with amino acids in the ligand binding pocket based upon the 

20 crystalline structure of the PPARa LBD polypeptide; synthesizing the 
modulator; and determining whether the potential modulator modulates the 
activity of the PPAR polypeptide. In a preferred embodiment, the modulator is 
designed for a PPARa polypeptide. Preferably, the PPARa polypeptide 
comprises the nucleic acid sequence of SEQ ID NO:1, and the PPARa LBD 

25 comprises the nucleic acid sequence SEQ ID NO:3. The determination of 
whether the modulator modulates the biological activity of a PPAR 
polypeptide is made in accordance with the screening methods disclosed 
herein, or by other screening methods known to those of skill in the art. 
Modulators can be synthesized using techniques known to those of ordinary 

30 skill in the art. 

In an alternative embodiment, a method of designing a modulator of a 
PPAR polypeptide in accordance with the present invention is disclosed 
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comprising: (a) selecting a candidate PPAR ligand; (b) determining which 
amino acid or amino acids of an PPAR polypeptide interact with the ligand 
using a three-dimensional model of a crystallized PPARa LBD; (c) identifying 
in a biological assay for PPAR activity a degree to which the ligand modulates 
5 the activity of the PPAR polypeptide; (d) selecting a chemical modification of 
the ligand wherein the interaction between the amino acids of the PPAR 
polypeptide and the ligand is predicted, to be modulated by the chemical 
modification; (e) performing the chemical modification on the ligand to form a 
modified ligand; (f) contacting the modified ligand with the PPAR polypeptide; 

10 (g) identifying in a biological assay for PPAR activity a degree to which the 
modified ligand modulates the biological activity of the PPAR polypeptide ; 
and (h) comparing the biological activity of the PPAR polypeptide in the 
presence of modified ligand with the biological activity of the PPAR 
polypeptide in the presence of the unmodified ligand, whereby a modulator of 

15 an PPAR polypeptide is designed. 

VI1IA2. Methods for Using the PPARa LBD Structural 

Coordinates For Molecular Design 
For the first time, the present invention permits the use of molecular 
20 design techniques to design, select and synthesize chemical entities and 
compounds, including modulatory compounds, capable of binding to the 
ligand binding pocket or an accessory binding site of PPARa and the PPARa 
LBD, in whole or in part. Correspondingly, the present invention also provides 
for the application of similar techniques in the design of modulators of any 
25 PPAR polypeptide. 

In accordance with a preferred embodiment of the present invention, 
the structure coordinates of a crystalline PPARa LBD can be used to design 
compounds that bind to a PPAR LBD (more preferably a PPARa LBD) and 
alter the properties of a PPAR LBD (for example, the dimerization or ligand 
30 binding ability) in different ways. One aspect of the present invention provides 
for the design of compounds that can compete with natural or engineered 
ligands of a PPAR polypeptide by binding to all, or a portion of, the binding 
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sites on a PPAR LBD. The present invention also provides for the design of 
compounds that can bind to all, or a portion of, an accessory binding site on a 
PPAR that is already binding a ligand. Similarly, non-competitive 
agonists/iigands that bind to and modulate PPAR LBD activity, whether or not 
5 it is bound to another chemical entity, can be designed using the PPAR LBD 
structure coordinates of this invention. 

A second design approach is to probe a PPAR or PPAR LBD 
(preferably a PPARa or PPARa LBD) crystal with molecules comprising a 
variety of different chemical entities to determine optimal sites for interaction 

10 between candidate PPAR or PPAR LBD modulators and the polypeptide. For 
example, high resolution X-ray diffraction data collected from crystals 
saturated with solvent allows the determination of the site where each type of 
solvent molecule adheres. Small molecules that bind tightly to those sites can 
then be designed and synthesized and tested for their PPARa modulator 

15 activity. Representative designs are also disclosed in published PCT 
application WO 99/26966. 

Once a computationally-designed ligand is synthesized using the 
methods of the present invention or other methods known to those of skill in 
the art, assays can be used to establish its efficacy of the ligand as a 

20 modulator of PPAR (preferably PPARa) activity. After such assays, the 
ligands can be further refined by generating intact PPAR, or PPAR LBD, 
crystals with a ligand bound to the LBD. The structure of the ligand can then 
be further refined using the chemical modification methods described herein 
and known to those of skill in the art, in order to improve the modulation 

25 activity or the binding affinity of the ligand. This process can lead to second 
generation ligands with improved properties. 

Ligands also can be selected that modulate PPAR responsive gene 

r 

transcription by the method of altering the interaction of co-activators and co- 
repressors with their cognate PPAR. For example, agonistic ligands can be 
30 selected that block or dissociate a co-repressor from interacting with the 
PPAR, and/or that promote binding or association of a co-activator. 
Antagonistic ligands can be selected that block co-activator interaction and/or 
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promote co-repressor interaction with a target receptor. Selection can be 



modulatory properties. Preferably, interactions of a PPARa polypeptide are 
targeted. Suitable assays for screening that can be employed, mutatis 
5 mutandis in the present invention, are described in as described in Nichols et 
aK, (1998) Anal. Biochem. 257: 112-19 and Xuetal., (1999) Mol. Cell 3: 397- 
403, which are incorporated herein in their entirety by reference. 



The design of candidate substances, also referred to as "compounds" 
or "candidate compounds", that bind to or inhibit PPAR LBD-mediated activity 
according to the present invention generally involves consideration of two 
factors. First, the compound must be capable of physically and structurally 

15 associating with a PPAR LBD. Non-covalent molecular interactions important 
in the association of a PPAR LBD with its substrate include hydrogen 
bonding, van der Waals interactions and hydrophobic interactions. 

Second, the compound must be able to assume a conformation that 
allows it to associate with a PPAR LBD. Although certain portions of the 

20 compound will not directly participate in this association with a PPAR LBD, 
those portions can still influence the overall conformation of the molecule. 
This, in turn, can have a significant impact on potency. Such conformational 
requirements include the overall three-dimensional structure and orientation of 
the chemical entity or compound in relation to all or a portion of the binding 

* 

25 site, e.g., the ligand binding pocket or an accessory binding site of a PPAR 
LBD, or the spacing between functional groups of a compound comprising 
several chemical entities that directly interact with a PPAR LBD. 

The potential modulatory or binding effect of a chemical compound on 
a PPAR LBD can be analyzed prior to its actual synthesis and testing by the 

30 use of computer modeling techniques that employ the coordinates of a 
crystalline PPARa LBD polypeptide of the present invention. If the theoretical 
structure of the given compound suggests insufficient interaction and 




VIII.A.3. 



Methods of Designing PPARa LBD Modulator 
Compounds 
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association between it and a PPAR LBD, synthesis and testing of the 
compound is obviated. However, if computer modeling indicates a strong 
interaction, the molecule can then be synthesized and tested for its ability to 
bind and modulate the activity of a PPAR LBD. In this manner, synthesis of 
5 unproductive or inoperative compounds can be avoided. 

A modulatory or other binding compound of a PPAR LBD polypeptide 
(preferably a PPARa LBD) can be computationally evaluated and designed 
via a series of steps in which chemical entities or fragments are screened and 
selected for their ability to associate with the individual binding sites or other 

1 0 areas of a crystalline PPARaLBD polypeptide of the present invention. 

One of several methods can be used to screen chemical entities or 
fragments for their ability to associate with a PPAR LBD and, more 
particularly, with the individual binding sites of a PPAR LBD, such as ligand 
binding pocket or an accessory binding site. This process can begin by visual 

1 5 inspection of, for example, the ligand binding pocket on a computer screen 
based on the PPARa LBD atomic coordinates in Table 2. Selected fragments 
or chemical entities can then be positioned in a variety of orientations, or 
docked, within an individual binding site of a PPARa LBD as defined herein 
above. Docking can be accomplished using software programs such as those 

20 available under the tradenames QUANTA™ (Molecular Simulations Inc., San 
Diego, California) and SYBYL™ (Tripos, Inc., St. Louis, Missouri), followed by 
energy minimization and molecular dynamics with standard molecular 
mechanics forcefields, such as CHARM ( Brooks et al. , (1983) J. Comp. 
Chem., 8: 132) and AMBER 5 (Case et al. , (1997), AMBER 5, University of 

25 California, San Francisco; Peariman et al„ (1995) Comput. Phys. Commun. 
91: 1^1). 

Specialized computer programs can also assist in the process of 
selecting fragments or chemical entities. These include: 

1. GRID™ program, version 17 ( Goodford , (1985) J. Med. Chem. 28: 
30 849-57), which is available from Molecular Discovery Ltd., Oxford, UK; 

2. MCSS™ program (Miranker & Karplus , (1991) Proteins 11: 29-34), 
which is available from Molecular Simulations, Inc., San Diego, California; 
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3. AUTODOCK™ 3.0 program (Goodsell & Olsen , (1990) Proteins 8: 
195-202), which is available from the Scripps Research Institute, La Jolla, 
California; 

4. DOCK™ 4.0 program (Kuntz et at. , (1992) J. Mol. Biol. 161: 269- 
5 88), which is available from the University of California, San Francisco, 

California; 

5. FLEX-X™ program (See, Rarey et al. , (1996) J. Comput. Aid. Mol. 
Des. 10:41-54), which is available from Tripos, Inc., St. Louis, Missouri; 

« 

6. MVP program (Lambert , (1997) in Practical Application of 
10 Computer-Aided Drug Design . (Charifson, ed.) Marcel-Dekker, New York, pp. 

243-303); and 

7. LUDI™ program (Bohm , (1992) J. Comput. Aid. Mol. Des., 6: 61- 
78), which is available from Molecular Simulations, Inc., San Diego, California. 

Once suitable chemical entities or fragments have been selected, they 
15 can be assembled into a single compound or modulator. Assembly can 
proceed by visual inspection of the relationship of the fragments to each other 
on the three-dimensional image displayed on a computer screen in relation to 
the structure coordinates of a PPARa LBD. Manual model building using 
software such as QUANTA™ or SYBYL™ typically follows. 
20 Useful programs to aid one of ordinary skill in the art in connecting the 

individual chemical entities or fragments include: 

. 1. CAVEAT™ program (Bartlett et al. , (1989) Spec/a/ Pub., Royal 
Chem. Soc. 78: 182-96), which is available from the University of California, 
Berkeley, California; 

25 2. 3D Database systems, such as MACCS-3D™ system program, 

which is available from MDL Information Systems, San Leandro, California. 
This area is reviewed in Martin , (1992) J. Med. Chem. 35: 2145-54; and 

3. HOOK™ program (Eisen et al. , (1994). Proteins 19: 199-221), 
which is available from Molecular Simulations, Inc., San Diego, California. 

30 Instead of proceeding to build a PPAR LBD modulator (preferably a 

PPARa LBD modulator) in a step-wise fashion one fragment or chemical 
entity at a time as described above, modulatory or other binding compounds 
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can be designed as a whole or de novo using the structural coordinates of a 
crystalline PPARa LBD polypeptide of the present invention and either an 
empty binding site or optionally including some portion(s) of a known 
modulator(s). Applicable methods can employ the following software 
5 programs: 

1. LUDI™ program (Bohm, (1992) J. Comput. Aid. MoL Des., 6: 61- 
78), which is available from Molecular Simulations, Inc., San Diego, California; 

2. LEGEND™ program (Nishibata & Itai , (1991) Tetrahedron 47: 
8985); and 

10 3. LEAPFROG™, which is available from Tripos Associates, St. Louis, 

Missouri. 

Other molecular modeling techniques can also be employed in 
accordance with this invention. See, e.g. , Cohen et al. , (1990) J. Med. Chem. 
33: 883-94. See also , Navia & Murcko , (1992) Curr. Opin. Struc. Biol. 2: 202- 

15 10; U.S. Patent No. 6,008,033, herein incorporated by reference. 

Once a compound has been designed or selected by the above 
methods, the efficiency with which that compound can bind to a PPARa LBD 
can be tested and optimized by computational evaluation. By way of 
particular example, a compound that has been designed or selected to 

20 function as a PPARa LBD modulator should also preferably traverse a volume 
not overlapping that occupied by the binding site when it is bound to its native 
ligand. Additionally, an effective PPAR LBD modulator should preferably 
demonstrate a relatively small difference in energy between its bound and 
free states (i.e., a small deformation energy of binding). Thus, the most 

25 efficient PPAR LBD modulators should preferably be designed with a 
deformation energy of binding of not greater than about 10 kcal/mole, and 
preferably, not greater than 7 kcal/mole. It is possible for PPAR LBD 
modulators to interact with the polypeptide in more than one conformation that 
is similar in overall binding energy. In those cases, the deformation energy of 

30 binding is taken to be the difference between the energy of the free compound 
and the average energy of the conformations observed when the modulator 
binds to the polypeptide. 
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A compound designed or selected as binding to a PPAR polypeptide 
(preferably a PPARcc LBD polypeptide) can be further computationally 
optimized so that in its bound state it would preferably lack repulsive 
electrostatic interaction with the target polypeptide. Such non-complementary 
(e.g., electrostatic) interactions include repulsive charge-charge, dipole-dipole 
and charge-dipole interactions. Specifically, the sum of all electrostatic 
interactions between the modulator and the polypeptide when the modulator 
is bound to a PPAR LBD preferably make a neutral or favorable contribution 
to the enthalpy of binding. 

Specific computer software is available in the art to evaluate compound 
deformation energy and electrostatic interaction. Examples of programs 
designed for such uses include: 

1. Gaussian 98™, which is available from Gaussian, Inc., Pittsburgh, 
Pennsylvania; 

2. AMBER™ program, version 6.0, which is available from the 
University of California at San Francisco; 

3. QUANTA™ program, which is available from Molecular Simulations, 
Inc., San Diego, California; 

4. ' CHARMm® program, which is available from Molecular Simulations, 
Inc., San Diego, California; and 

4. Insight II® program, which is available from Molecular Simulations, 
Inc., San Diego, California. 

These programs can be implemented using a suitable computer 
system. Other hardware systems and software packages will be apparent to 
those skilled in the art after review of the disclosure of the present invention 
presented herein. 

Once a PPAR LBD modulating compound has been optimally selected 
or designed, as described above, substitutions can then be made in some of 
its atoms or side groups in order to improve or modify its binding properties. 
Generally, initial substitutions are conservative, i.e., the replacement group 
will have approximately the same size, shape, hydrophobicity and charge as 
the original group. It should, of course, be understood that components 
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known in the art to alter conformation should be avoided. Such substituted 
chemical compounds can then be analyzed for efficiency of fit to a PPAR LBD 
binding site using the same computer-based approaches described in detail 
above. 

5 VII l.B. Distinguishing Between PPAR Subtypes 

The present invention discloses the ability to generate new synthetic 
ligands to distinguish between PPAR subtypes. As described herein, 
computer-designed ligands can be generated that distinguish between PPAR 
subtypes, thereby allowing the generation of either tissue specific or function 

10 specific ligands. The atomic structural coordinates disclosed in the present 
invention reveal structural details unique to PPARoc. These structural details 
can be exploited when a novel ligand is designed using the methods of the 
present invention or other ligand design methods known in the art. The 
structural features that differentiate, for example, a PPARct from a PPARy can 

15 be targeted in ligand design. Thus, for example, a ligand can be designed 
that will recognize PPARa, while not interacting with other PPARs or even 
with moieties having similar structural features. Prior to the disclosure of the 
present invention, the ability to target a PPAR subtype was unattainable. 

The present invention also pertains to a method for designing an 

20 agonist or modulator with desired levels of activity on the three subtypes, 
PPARa, PPARy and PPARS. In a preferred embodiment, the method 
comprises obtaining atomic coordinates for structures of the PPARa, PPARy 
and/or PPARS ligand binding domains. The structures can comprise PPARa, 
PPARy and PPARS each bound to various different ligands, and also can 

25 comprise structures where no ligand is present. The structures can also 
comprise models where a compound has been docked into a particular PPAR 
using a molecular docking procedure, such as the MVP program disclosed 
herein. Optionally, the structures comprise rotated and translated so as to 
superimpose corresponding Ca or backbone atoms; this facilitates the 

30 comparison of structures. 

The PPARa, PPARy and PPARS structures can also be compared 
using a computer graphics system to identify regions of the ligand binding site 
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that have similar shape and electrostatic character, and to identify regions of 
the ligand binding site that are narrowed or constricted in one or two of the 
PPARs compared with the other(s). Since these three PPARs are subject to 
conformational changes, attention is paid to the range of motion observed for 
5 each protein atom over the whole collection of structures. The ligand 
structures, including both those determined by X-ray crystallography and 
those modeled using molecular docking procedures, can be examined using a 
computer graphics system to identify ligands where a chemical modification 
. could increase or decrease binding to a particular PPAR, or decrease activity 

10 against a particular PPAR. Additionally or alternatively, the chemical 
modification can introduce a group into a volume that is normally occupied by 
an atom of that PPAR. 

Optionally, to selectively decrease activity against a particular PPAR, 
the chemical modification can be made so as to occupy volume that is 

15 normally occupied by atoms of that particular PPAR, but not by atoms of the 
other PPARs. To increase activity against a particular PPAR, a chemical 
modification can be made that improves interactions with that particular 
PPAR. To selectively increase activity against a particular PPAR, a chemical 
modification can be made that improves the interactions with that particular 

20 PPAR, but does not improve the interactions with the other PPARs. Other 
design principles can also be used to increase or decrease activity on a 
particular PPAR. 

Thus, various possible compounds and chemical modfications can be 
considered and compared graphically, and with molecular modeling tools, for 
25 synthetic feasibility and likelihood of achieving the desired profile of activation 
of PPARa, PPARy and PPAR5. Compounds that appear synthetically 
feasible and that have a good likelihood of achieving the desired profile are 
synthesized. The compounds can then be tested for binding and/or activation 
of PPARa, PPARy and PPAR5, and tested for their overall biological effect. 

30 
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VIII.C. Method of Screening for Chemical and Biological Modulators of 

the Biological Activity of PPARa 
A candidate substance identified according to a screening assay of the 
present invention has an ability to modulate the biological activity of a PPAR 
5 or a PPAR LBD polypeptide. In a preferred embodiment, such a candidate 
compound can have utility in the treatment of disorders and conditions 
associated with the biological activity of a PPARa or a PPARa LBD 
polypeptide, including lipid homeostasis. 

In a cell-free system, the method comprises the steps of establishing a 
10 control system comprising a crystalline PPARa polypeptide and a ligand 
which is capable of binding to the polypeptide; establishing a test system 
comprising a crystalline PPARa polypeptide, the ligand, and a candidate 
compound; and determining whether the candidate compound modulates the 
activity of the polypeptide by comparison of the test and control systems. A 
1 5 representative ligand comprises Compound 1 or other small molecule, and in 
this embodiment, the biological activity or property screened includes binding 
affinity. 

In another embodiment of the invention, a crystalline form of a 
PPARa polypeptide or a catalytic or immunogenic fragment or oligopeptide 

20 thereof, can be used for screening libraries of compounds in any of a variety 
of drug screening techniques. The fragment employed in such a screening 
can be affixed to a solid support. The formation of binding complexes, 
between a crystalline PPARa polypeptide and the agent being tested, will be 
detected. In a preferred embodiment, the crystalline PPARa polypeptide has 

25 an amino acid sequence of SEQ ID NO:2. When a PPARa LBD polypeptide 
is employed, a preferred embodiment will include a crystalline PPARa 
polypeptide having the amino acid sequence of SEQ ID NO:4. 

Another technique for drug screening which can be used provides for 
high throughput screening of compounds having suitable binding affinity to the 

30 protein of interest as described in published PCT application WO 84/03564, 
herein incorporated by reference. In this method, as applied to a crystalline 
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polypeptide of the present invention, large numbers of different small test 
compounds are synthesized on a solid substrate, such as plastic pins or some 
other surface. The test compounds are reacted with the crystalline 
polypeptide, or fragments thereof. Bound polypeptide is then detected by 
5 methods well known to those of skill in the art. The crystalline polypeptide 
can also be placed directly onto plates for use in the aforementioned drug 
screening techniques. 

In yet another embodiment, a method of screening for a modulator of a 
PPARa or a PPARa LBD polypeptide comprises: providing a library of test 

10 samples; contacting a crystalline form of PPARa or a crystalline form of an 
PPARa LBD polypeptide with each test sample; detecting an interaction 
between a test sample and a crystalline form of a PPARa or a crystalline form 
of a PPARa LBD polypeptide; identifying a test sample that interacts with a 
crystalline form of a PPARa or a crystalline form of a PPARa LBD 

15 polypeptide; and isolating a test sample that interacts with a crystalline form of 
a PPARa or a crystalline form of a PPARa LBD polypeptide. 

In each of the foregoing embodiments, an interaction can be detected 
spectrophotometrically, radiologically or immunologically. An interaction 
between a crystalline form of a PPARa or a crystalline form of a PPARa LBD 

20 polypeptide and a test sample can also be quantified using methodology 
known to those of skill in the art. 

In accordance with the present invention there is also provided a rapid 
and high throughput screening method that relies on the methods described 
above. This screening method comprises separately contacting each of a 

25 plurality of substantially identical samples with crystalline form of a PPARa or 
a crystalline form of a PPARa LBD and detecting a resulting binding complex. 
In such a screening method the plurality of samples preferably comprises 
more than about 10 4 samples, or more preferably comprises more than about 
5x 10 4 samples. 

30 

VIII. D. Method of Identifying Compounds Which Inhibit Ligand Binding 
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In one aspect of the present invention, an assay method for identifying 
a compound that inhibits binding of a ligand to a PPAR polypeptide is 
disclosed. A ligand of PPARa, such as Compound 1, can be used in the 
assay method as the ligand against which the inhibition by a test compound is 
5 gauged. The method comprises (a) incubating a PPAR polypeptide with a 
ligand in the presence of a test inhibitor compound; (b) determining an 
amount of ligand that is bound to the PPAR polypeptide, wherein decreased 
binding of ligand to the PPAR polypeptide in the presence of the test inhibitor 
compound relative to binding in the absence of the test inhibitor compound is 
10 indicative of inhibition; and (c) identifying the test compound as an inhibitor of 
ligand binding if decreased ligand binding is observed. Preferably, the ligand 
is Compound 1. 

In another aspect of the present invention, the disclosed assay method 
can be used in the structural refinement of candidate PPAR inhibitors. For 
15 example, multiple rounds of optimization can be followed by gradual structural 
changes in a strategy of inhibitor design. A strategy such as this is made 
possible by the disclosure of the coordinates of the PPARa LBD. 

I* Design, Preparation and Structural Analysis of Additional PPARa and 

20 PPARa LBD Mutants and Structural Equivalents 

The present invention provides for the generation of PPAR and PPAR 
mutants (preferably PPARa and PPARa LBD mutants), and the ability to 
solve the crystal structures of those that crystallize. More particularly, through 
the provision of the three-dimensional structure of a PPARa LBD, desirable 

25 sites for mutation can be identified. 

The structure coordinates of a PPARa LBD provided in accordance 
with the present invention also facilitate the identification of related proteins or 
enzymes analogous to PPARa in function, structure or both, (for example, a 
PPARy) which can lead to novel therapeutic modes for treating or preventing 

30 a range of disease states. 
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IX.A. Sterically Similar Compounds 

A further aspect of the present invention is that sterically similar 
compounds can be formulated to mimic the key portions of a PPAR LBD 
structure. Such compounds are functional equivalents. The generation of a 
5 structural functional equivalent can be achieved by the techniques of 
modeling and chemical design known to those of skill in the art and described 
herein. Modeling and chemical design of PPAR and PPAR LBD structural 
equivalents can be based on the structure coordinates of a crystalline PPARa 
LBD polypeptide of the present invention. It will be understood that all such 
1 0 sterically similar constructs fall within the scope of the present invention. 

IX.B. PPARa Polypeptides 

The generation of chimeric PPAR polypeptides is also an aspect of the 
present invention. Such a chimeric polypeptide can comprise a PPAR LBD 

1 5 polypeptide or a portion of a PPAR LBD, (e.g. a PPARa LBD) that is fused to 
a candidate polypeptide or a suitable region of the candidate polypeptide, for 
example PPARy. Throughout the present disclosure it is intended that the 
term "mutant" encompass not only mutants of a PPAR LBD polypeptide but 
chimeric proteins generated using a PPAR LBD as well. It is thus intended 

20 that the following discussion of mutant PPAR LBDs apply mutatis mutandis to 
chimeric PPAR and PPAR LBD polypeptides and to structural equivalents 
thereof. 

In accordance with the present invention, a mutation can be directed to 
a particular site or combination of sites of a wild-type PPAR LBD. For 

25 example, an accessory binding site or the binding pocket can be chosen for 
mutagenesis. Similarly, a residue having a location on, at or near the surface 
of the polypeptide can be replaced, resulting in an altered surface charge of 
one or more charge units, as compared to the wild-type PPAR and PPAR 
LBD. Alternatively, an amino acid residue in a PPAR or a PPAR LBD can be 

30 chosen for replacement based on its hydrophilic or hydrophobic 
characteristics. 
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Such mutants can be characterized by any one of several different 
properties as compared with the wild-type PPAR LBD. For example, such 
mutants can have an altered surface charge of one or more charge units, or 
can have an increase in overall stability. Other mutants can have altered 
5 substrate specificity in comparison with, or a higher specific activity than, a 
wild-type PPAR or PPAR LBD. 

PPAR and PPAR LBD mutants of the present invention can be 
generated in a number of ways. For example, the wild-type sequence of a 
PPAR or a PPAR LBD can be mutated at those sites identified using this 

10 invention as desirable for mutation, by means of oligonucleotide-directed 
mutagenesis or other conventional methods, such as deletion. Alternatively, 
mutants of a PPAR or a PPAR LBD can be generated by the site-specific 
replacement of a particular amino acid with an unnaturally occurring amino 
acid. In addition, PPAR or PPAR LBD mutants can be generated through 

1 5 replacement of an amino acid residue, for example, a particular cysteine or 
methionine residue, with selenocysteine or selenomethionine. This can be 
achieved by growing a host organism capable of expressing either the wild- 
type or mutant polypeptide on a growth medium depleted of either natural 
cysteine or methionine (or both) but enriched in selenocysteine or 

20 selenomethionine (or both). 

Mutations can be introduced into a DNA sequence coding for a PPAR 
or a PPAR LBD using synthetic oligonucleotides. These oligonucleotides 
contain nucleotide sequences flanking the desired mutation sites. Mutations 
can be generated in the full-length DNA sequence of a PPAR or a PPAR LBD 

25 or in any sequence coding for polypeptide fragments of a PPAR or a PPAR 
LBD. 

According to the present invention, a mutated PPAR or PPAR LBD 
DNA sequence produced by the methods described above, or any alternative 
methods known in the art, can be expressed using an expression vector. An 
30 expression vector, as is well known to those of skill in the art, typically 
includes elements that permit autonomous replication in a host cell 
independent of the host genome, and one or more phenotypic markers for 
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selection purposes. Either prior to or after insertion of the DNA sequences 
surrounding the desired PPAR or PPAR LBD mutant coding sequence, an 
expression vector also will include control sequences encoding a promoter, 
operator, ribosome binding site, translation initiation signal, and, optionally, a 
5 repressor gene or various activator genes and a signal for termination. In 
some embodiments, where secretion of the produced mutant is desired, 
nucleotides encoding a "signal sequence" can be inserted prior to a PPAR or 
a PPAR LBD mutant coding sequence. For expression under the direction of 
the control sequences, a desired DNA sequence must be operatively linked to 

10 the control sequences; that is, the sequence must have an appropriate start 
signal in front of the DNA sequence encoding the PPAR or PPAR LBD 
mutant, and the correct reading frame to permit expression of that sequence 
under the control of the control sequences and production of the desired 
product encoded by that PPAR or PPAR LBD sequence must be maintained. 

15 Any of a wide variety of well-known available expression vectors can 

be useful to express a mutated PPAR or PPAR LBD coding sequences of this 
invention. These include for example, vectors consisting of segments of 
chromosomal, non-chromosomal and synthetic DNA sequences, such as 
various known derivatives of SV40, known bacterial plasmids, e.g., plasmids 

20 from £. coli including col E1 f pCR1, pBR322, pMB9 and their derivatives, 
wider host range plasmids, e.g., RP4, phage DNAs, e.g., the numerous 
derivatives of phage X, e.g., NM 989, and other DNA phages, e.g., M13 and 
filamentous single stranded DNA phages, yeast plasmids and vectors derived 
from combinations of plasmids and phage DNAs, such as plasmids which 

25 have been modified to employ phage DNA or other expression control 
sequences. In the preferred embodiments of this invention, vectors amenable 
to expression in a pRSETA-based expression system are employed. The 
pRSETA expression system is available from Invitrogen, Inc., Carlsbad, 
California. 

30 In addition, any of a wide variety of expression control sequences — 

sequences that control the expression of a DNA sequence when operatively 
linked to it— can be used in these vectors to express the mutated DNA 
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sequences according to this invention. Such useful expression control 
sequences, include, for example, the early and late promoters of SV40 for 
animal cells, the lac system, the trp system the TAC or TRC system, the 
major operator and promoter regions of phage X, the control regions of fd coat 
5 protein, all for E. coh\ the promoter for 3-phosphoglycerate kinase or other 
glycolytic enzymes, the promoters of acid phosphatase, e.g., Pho5, the 
promoters of the yeast a-mating factors for yeast, and other sequences 
known to control the expression of genes of prokaryotic or eukaryotic cells or 
their viruses, and various combinations thereof. 

10 A wide variety of hosts are also useful for producing mutated 

PPARaand PPARot LBD polypeptides according to this invention. These 
hosts include, for example, bacteria, such as E. coll Bacillus and 
Streptomyces, fungi, such as yeasts, and animal cells, such as CHO and 
COS-1 cells, plant cells, insect cells, such as Sf9 cells, and transgenic host 

15 cells. 

It should be understood that not all expression vectors and expression 
systems function in the same way to express mutated DNA sequences of this 
invention, and to produce modified PPAR and PPAR LBD polypeptides or 
PPAR or PPAR LBD mutants. Neither do all hosts function equally well with 

20 the same expression system. One of skill in the art can, however, make a 
selection among these vectors, expression control sequences and hosts 
without undue experimentation and without departing from the scope of this 
invention. For example, an important consideration in selecting a vector will 
be the ability of the vector to replicate in a given host. The copy number of 

25 the vector, the ability to control that copy number, and the expression of any 
other proteins encoded by the vector, such as antibiotic markers, should also 
be considered. 

* 

In selecting an expression control sequence, a variety of factors should 
also be considered. These include, for example, the relative strength of the 
30 system, its controllability and its compatibility with the DNA sequence 
encoding a modified PPAR or PPAR LBD polypeptide of this invention, with 
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particular regard to the formation of potential secondary and tertiary 
structures. 

Hosts should be selected by consideration of their compatibility with the 
chosen vector, the toxicity of a modified PPAR or PPAR LBD to them, their 
5 ability to express mature products, their ability to fold proteins correctly, their 
fermentation requirements, the ease of purification of a modified PPAR or 
PPAR LBD and safety. Within these parameters, one of skill in the art can 
select various vector/expression control system/host combinations that will 
produce useful amounts of a mutant PPAR or PPAR LBD. A mutant PPAR or 

10 PPAR LBD produced in these systems can be purified by a variety of 
conventional steps and strategies, including those used to purify the wild-type 
PPAR or PPAR LBD. 

Once a PPAR LBD mutation(s) has been generated in the desired 
location, such as an active site or dimerization site, the mutants can be tested 

15 for any one of several properties of interest. For example, mutants can be 
screened for an altered charge at physiological pH. This is determined by 
measuring the mutant PPAR or PPAR LBD isoelectric point (pi) and 
comparing the observed value with that of the wild-type parent. Isoelectric 
point can be measured by gel-electrophoresis according to the method of 

20 Wellner (Wellner , (1971) Anal. Chem. 43: 597). A mutant PPAR or PPAR 
LBD polypeptide containing a replacement amino acid located at the surface 
of the enzyme, as provided by the structural information of this invention, can 
lead to an altered surface charge and an altered pi. 

25 IX.C. Generation of an Engineered PPARg LBD or PPARa LBD 

Mutant 

In another aspect of the present invention, a unique PPAR or PPAR 
LBD polypeptide can be generated. Such a mutant can facilitate purification 
and the study of the ligand-binding abilities of a PPAR polypeptide. 
30 As used in the following discussion, the terms "engineered PPAR", 

"engineered PPAR LDB", "PPAR mutant", and "PPAR LBD mutant" refers to 
polypeptides having amino acid sequences which contain at least one 
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mutation in the wild-type sequence. The terms also refer to PPAR and PPAR 
LBD polypeptides which are capable of exerting a biological effect in that they 
comprise all or a part of the amino acid sequence of an engineered PPAR or 
PPAR LBD mutant polypeptide of the present invention, or cross-react with 
5 antibodies raised against an engineered PPAR or PPAR LBD mutant 
polypeptide, or retain all or some or an enhanced degree of the biological 
activity of the engineered PPAR or PPAR LBD mutant amino acid sequence 
or protein. Such biological activity can include the binding of small molecules 
in general, and the binding of Compound 1 in particular. 

10 The terms "engineered PPAR LBD" and "PPAR LBD mutant" also 

includes analogs of an engineered PPAR LBD or PPAR LBD mutant 
polypeptide. By "analog" is intended that a DNA or polypeptide sequence can 
contain alterations relative to the sequences disclosed herein, yet retain all or 
some or an enhanced degree of the biological activity of those sequences. 

1 5 Analogs can be derived from genomic nucleotide sequences or from other 
organisms, or can be created synthetically. Those of skill in the art will 
appreciate that other analogs, as yet undisclosed or undiscovered, can be 
used to design and/or construct PPAR LBD or PPAR LBD mutant analogs. 
There is no need for an engineered PPAR LBD or PPAR LBD mutant 

20 polypeptide to comprise all or substantially all of the amino acid sequence of 
SEQ ID NOs:2 or 4. Shorter or longer sequences are anticipated to be of use 
in the invention; shorter sequences are herein referred to as "segments". 
Thus, the terms "engineered PPAR LBD" and "PPAR LBD mutant" also 
includes fusion, chimeric or recombinant engineered PPAR LBD or PPAR 

25 LBD mutant polypeptides and proteins comprising sequences of the present 
invention. Methods of preparing such proteins are disclosed herein above 
and are known in the art. 

IX.D. Sequence Similarity and Identity 
30 As used herein, the term "substantially similar" means that a particular 

sequence varies from nucleic acid sequence of SEQ ID NOs:1 or 3, or the 
amino acid sequence of SEQ ID NOs:2 or 4 by one or more deletions, 
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substitutions, or additions, the net effect of which is to retain at least some of 
biological activity of the natural gene, gene product, or sequence. Such 
sequences include "mutant" or "polymorphic" sequences, or sequences in 
which the biological activity and/or the physical properties are altered to some 
5 degree but retains at least some or an enhanced degree of the original 
biological activity and/or physical properties. In determining nucleic acid 
sequences, all subject nucleic acid sequences capable of encoding 
substantially similar amino acid sequences are considered to be substantially 
similar to a reference nucleic acid sequence, regardless of differences in 
10 codon sequences or substitution of equivalent amino acids to create 
biologically functional equivalents. 

IX.D.1. Sequences That are Substantially Identical to an 

Engineered PPAR or PPAR LBD Mutant Sequence of the 

15 Present Invention 

Nucleic acids that are substantially identical to a nucleic acid sequence 
of an engineered PPAR or PPAR LBD mutant of the present invention, e.g. 
allelic variants, genetically altered versions of the gene, etc., bind to an 
engineered PPAR or PPAR LBD mutant sequence under stringent 

20 hybridization conditions. By using probes, particularly labeled probes of DNA 
sequences, one can isolate homologous or related genes. The source of 
homologous genes can be any species, e.g. primate species; rodents, such 
as rats and mice, canines, felines, bovines, equines, yeast, nematodes, etc. 
Between mammalian species, e.g. human and mouse, homologs have 

25 substantial sequence similarity, i.e. at least 75% sequence identity between 
nucleotide sequences. Sequence similarity is calculated based on a 
reference sequence, which can be a subset of a larger sequence, such as a 
conserved motif, coding region, flanking region, etc. A reference sequence 
will usually be at least about 18 nt long, more usually at least about 30 nt long, 

30 and can extend to the complete sequence that is being compared. Algorithms 
for sequence analysis are known in the art, such as BLAST, described in 
Altschul et al. , (1990) J. Mol. Biol. 215: 403-10. 
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Percent identity or percent similarity of a DNA or peptide sequence can 
be determined, for example, by comparing sequence information using the 
GAP computer program, available from the University of Wisconsin Geneticist 
Computer Group. The GAP program utilizes the alignment method of 
5 Needleman et al. . (1970) J. Mol. Biol. 48: 443, as revised by Smith et al. , 
(1981) Adv. Appl. Math. 2:482. Briefly, the GAP program defines similarity as 
the number of aligned symbols (i.e., nucleotides or amino acids) which are 
similar, divided by the total number of symbols in the shorter of the two 
sequences. The preferred parameters for the GAP program are the default 

1 0 parameters, which do not impose a penalty for end gaps. See, e.g. , Schwartz 
et al. , eds., (1979), Atlas of Protein Sequence and Structure, National 
Biomedical Research Foundation , pp. 357-358, and Gribskov et al. , (1986) 
Nucl. Acids. Res. 14: 6745. 

The term "similarity" is contrasted with the term "identity". Similarity is 

15 defined as above; "identity", however, means a nucleic acid or amino acid 
sequence having the same amino acid at the same relative position in a given 
family member of a gene family. Homology and similarity are generally 
viewed as broader terms than the term identity. Biochemically similar amino 
acids, for example leucine/isoleucine or glutamate/aspartate, can be present 

20 at the same position— these are not identical per se, but are biochemically 
"similar." As disclosed herein, these are referred to as conservative 
differences or conservative substitutions. This differs from a conservative 
mutation at the DNA level, which changes the nucleotide sequence without 
making a change in the encoded amino acid, e.g. TCC to TCA, both of which 

25 encode serine. 

As used herein, DNA analog sequences are "substantially identical" to 
specific DNA sequences disclosed herein if: (a) the DNA analog sequence is 
derived from coding regions of the nucleic acid sequence shown in SEQ ID 
NOs:1 or 3; or (b) the DNA analog sequence is capable of hybridization with 

30 DNA sequences of (a) under stringent conditions and which encode a 
biologically active PPARot or PPARa LBD gene product; or (c) the DNA 
sequences are degenerate as a result of alternative genetic code to the DNA 
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analog sequences defined in (a) and/or (b). Substantially identical analog 
proteins and nucleic acids will have between about 70% and 80%, preferably 
between about 81% to about 90% or even more preferably between about 
91% and 99% sequence identity with the corresponding sequence of the 
5 native protein or nucleic acid. Sequences having lesser degrees of identity 
but comparable biological activity are considered to be equivalents. 

As used herein, "stringent conditions" means conditions of high 
stringency, for example 6XSSC, 0.2% polyvinylpyrrolidone, 0.2% Ficoll, 0.2% 
bovine serum albumin, 0.1% sodium dodecyl sulfate, 100 ^ig/ml salmon sperm 

10 DNA and 15% formamide at 68°C. For the purposes of specifying additional 
conditions of high stringency, preferred conditions are salt concentration of 
about 200 mM and temperature of about 45°C. One example of such 
stringent conditions is hybridization at 4XSSC, at 65°C, followed by a washing 
in 0.1XSSC at 65°C for one hour. Another exemplary stringent hybridization 

1 5 scheme uses 50% formamide, 4XSSC at 42°C. 

In contrast, nucleic acids having sequence similarity are detected by 
hybridization under lower stringency conditions. Thus, sequence identity can 
be determined by hybridization under lower stringency conditions, for 
example, at 50°C or higher and 0.1XSSC (9 mM NaCI/0.9 mM sodium citrate) 

20 and the sequences will remain bound when subjected to washing at 55°C in 
1XSSC. 

IX.D.2. Complementarity and Hybridization to an Engineered 

PPARg or PPARg LBD Mutant Sequence 
25 As used herein, the term "complementary sequences" means nucleic 

acid sequences which are base-paired according to the standard Watson- 
Crick complementarity rules. The present invention also encompasses the 
use of nucleotide segments that are complementary to the sequences of the 
present invention. 

30 Hybridization can also be used for assessing complementary 

sequences and/or isolating complementary nucleotide sequences. As 
discussed above, nucleic acid hybridization will be affected by such conditions 
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as salt concentration, temperature, or organic solvents, in addition to the base 
composition, length of the complementary strands, and the number of 
nucleotide base mismatches between the hybridizing nucleic acids, as will be 
readily appreciated by those skilled in the art. Stringent temperature 
5 conditions will generally include temperatures in excess of about 30°C, 
typically in excess of about 37°C, and preferably in excess of about 45°C. 
Stringent salt conditions will ordinarily be less than about 1,000 mM, typically 
less than about 500 mM, and preferably less than about 200 mM. However, 
the combination of parameters is much more important than the measure of 

10 any single parameter. See, e.g. , Wetmur & Davidson , (1968) J. Mol. Biol. 31: 
349-70. Determining appropriate hybridization conditions to identify and/or 
isolate sequences containing high levels of homology is well known in the art. 
See, e.g. , Sambrook et al. , (1989) Molecular Cloning: A Laboratory Manual , 
Cold Spring Harbor, New York. 

15 IX.D.3. Functional Equivalents of an Engineered PPARa or 

PPARg LBD Mutant Nucleic Acid Sequence of the 
Present Invention 

As used herein, the term "functionally equivalent codon" is used to refer 
to codons that encode the same amino acid, such as the ACG and AGU 

20 codons for serine. PPARa or PPARa LBD-encoding nucleic acid sequences 
comprising SEQ ID NOs:1 and 3 which have functionally equivalent codons 
are covered by the present invention. Thus, when referring to the sequence 
example presented in SEQ ID NOs:1 and 3, applicants contemplate 
substitution of functionally equivalent codons into the sequence example of 

25 SEQ ID NOs:1 and 3. Thus, applicants are in possession of amino acid and 
nucleic acids sequences which include such substitutions but which are not 
set forth herein in their entirety for convenience. 

It will also be understood by those of skill in the art that amino acid and 
nucleic acid sequences can include additional residues, such as additional N- 

30 or C-terminal amino acids or 5' or 3' nucleic acid sequences, and yet still be 
essentially as set forth in one of the sequences disclosed herein, so long as 
the sequence retains biological protein activity where polypeptide expression 
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is concerned. The addition of terminal sequences particularly applies to 
nucleic acid sequences which can, for example, include various non-coding 
sequences flanking either of the 5' or 3' portions of the coding region or can 
include various internal sequences, i.e., introns, which are known to occur 
5 within genes. 

IX.D.4. Biological Equivalents 

The present invention envisions and includes biological equivalents of 
a engineered PPAR or PPAR LBD mutant polypeptide of the present 
10 invention. The term "biological equivalent" refers to proteins having amino 
. acid sequences which are substantially identical to the amino acid sequence 
of an engineered PPAR LBD mutant of the present invention and which are 
capable of exerting a biological effect in that they are capable of binding small 
molecules or cross-reacting with anti- PPAR or PPAR LBD mutant antibodies 
15 raised against an engineered mutant PPAR or PPAR LBD polypeptide of the 
present invention. 

For example, certain amino acids can be substituted for other amino 
acids in a protein structure without appreciable loss of interactive capacity 
with, for example, structures in the nucleus of a cell. Since it is the interactive 

20 capacity and nature of a protein that defines that protein's biological functional 
activity, certain amino acid sequence substitutions can be made in a protein 
sequence (or the nucleic acid sequence encoding it) to obtain a protein with 
the same, enhanced, or antagonistic properties. Such properties can be 
achieved by interaction with the normal targets of the protein, but this need 

25 not be the case, and the biological activity of the invention is not limited to a 
particular mechanism of action. It is thus in accordance with the present 
invention that various changes can be made in the amino acid sequence of an 
engineered PPAR or PPAR LBD mutant polypeptide of the present invention 
or its underlying nucleic acid sequence without appreciable loss of biological 

30 utility or activity. 

Biologically equivalent polypeptides, as used herein, are polypeptides 
in which certain, but not most or all, of the amino acids can be substituted. 
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Thus, when referring to the sequence examples presented in SEQ ID NOs:1 
and 3, applicants envision substitution of codons that encode biologically 
equivalent amino acids, as described herein, into the sequence example of 
SEQ ID NOs:2 and 4, respectively. Thus, applicants are in possession of 
5 amino acid and nucleic acids sequences which include such substitutions but 
which are not set forth herein in their entirety for convenience. 

Alternatively, functionally equivalent proteins or peptides can be 
created via the application of recombinant DNA technology, in which changes 
in the protein structure can be engineered, based on considerations of the 

1 0 properties of the amino acids being exchanged, e.g. substitution of He for Leu. 
Changes designed by man can be introduced through the application of site- 
directed mutagenesis techniques, e.g., to introduce improvements to the 
antigenicity of the protein or to test an engineered PPAR or PPAR LBD 
mutant polypeptide of the present invention in order to modulate lipid-binding 

1 5 or other activity, at the molecular level. 

Amino acid substitutions, such as those which might be employed in 
modifying an engineered PPAR or PPAR LBD mutant polypeptide of the 
present invention are generally, but not necessarily, based on the relative 
similarity of the amino acid side-chain substituents, for example, their 

20 hydrophobicity, hydrophilicity, charge, size, and the like. An analysis of the . 
size, shape and type of the amino acid side-chain substituents reveals that 
arginine, lysine and histidine are all positively charged residues; that alanine, 
glycine and serine are all of similar size; and that phenylalanine, tryptophan 
and tyrosine all have a generally similar shape. Therefore, based upon these 

25 considerations, arginine, lysine and histidine; alanine, glycine and serine; and 
phenylalanine, tryptophan and tyrosine; are defined herein as biologically 
functional equivalents. Other biologically functionally equivalent changes will 
be appreciated by those of skill in the art. It is implicit in the above discussion, 
however, that one of skill in the art can appreciate that a radical, rather than a 

30 conservative substitution is warranted in a given situation. Non-conservative 
substitutions in engineered mutant PPAR or PPAR LBD polypeptides of the 
present invention are also an aspect of the present invention. 
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In making biologically functional equivalent amino acid substitutions, 
the hydropathic index of amino acids can be considered. Each amino acid 
has been assigned a hydropathic index on the basis of their hydrophobicity 
and charge characteristics, these are: isoleucine (+ 4.5); valine (+ 4.2); 
5 leucine (+ 3.8); phenylalanine (+ 2.8); cysteine (+ 2.5); methionine (+ 1.9); 
alanine (+ 1.8); glycine (-0.4); threonine (-07); serine (-0.8); tryptophan (-0.9); 
tyrosine (-1.3); proline (-1.6); histidine (-3.2); glutamate (-3.5); glutamine (- 
3.5); aspartate (-3.5); asparagine (-3.5); lysine (-3.9); and arginine (-4.5). 

The importance of the hydropathic amino acid index in conferring 

10 interactive biological function on a protein is generally understood in the art 
( Kyte & Doolittle , (1982), J. Mol. Biol. 157: 105-132, incorporated herein by 
reference). It is known that certain amino acids can be substituted for other 
amino acids having a similar hydropathic index or score and still retain a 
similar biological activity. In making changes based upon the hydropathic 

1 5 index, the substitution of amino acids whose hydropathic indices are within ±2 
of the original value is preferred, those which are within ±1 of the original 
value are particularly preferred, and those within ±0.5 of the original value are 
even more particularly preferred. 

* 

It is also understood in the art that the substitution of like amino acids 
20 can be made effectively on the basis of hydrophilicity. U.S. Patent No. 
4,554,101, incorporated herein by reference, states that the greatest local 
average hydrophilicity of a protein, as governed by the hydrophilicity of its 
adjacent amino acids, correlates with its immunogenicity and antigenicity, i.e. 
with a biological property of the protein. It is understood that an amino acid 
25 can be substituted for another having a similar hydrophilicity value and still 
obtain a biologically equivalent protein. 

As detailed in U.S. Patent No. 4,554,101, the following hydrophilicity 
values have been assigned to amino acid residues: arginine (+ 3.0); lysine (+ 
3.0); aspartate (+ 3.0±1); glutamate (+ 3.0±1); serine (+ 0.3); asparagine (+ 
30 0.2); glutamine (+ 0.2); glycine (0); threonine (-0.4); proline (-0.5±1); alanine (- 
0.5); histidine (-0.5); cysteine (-1 .0); methionine (-1 .3); valine (-1 .5); leucine (- 
1 .8); isoleucine (-1 .8); tyrosine (-2.3); phenylalanine (-2.5); tryptophan (-3.4). 
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In making changes based upon similar hydrophilicity values, the 
substitution of amino acids whose hydrophilicity values are within ±2 of the 
original value is preferred, those which are within ±1 of the original value are 
particularly preferred, and those within ±0.5 of the original value are even 
more particularly preferred. 

While discussion has focused on functionally equivalent polypeptides 
arising from amino acid changes, it will be appreciated that these changes 
can be effected by alteration of the encoding DNA, taking into consideration 
also that the genetic code is degenerate and that two or more codons can 
code for the same amino acid. 

Thus, it will also be understood that this invention is not limited to the 
particular amino acid and nucleic acid sequences of SEQ ID NOs:1-4. 
Recombinant vectors and isolated DNA segments can therefore variously 
include an engineered PPARoc or PPARa LBD mutant polypeptide-encoding 
region itself, include coding regions bearing selected alterations or 
modifications in the basic coding region, or include larger polypeptides which 
nevertheless comprise an PPARa or PPARa LBD mutant polypeptide- 
encoding regions or can encode biologically functional equivalent proteins or 
polypeptides which have variant amino acid sequences. Biological activity of 
an engineered PPARa or PPARa LBD mutant polypeptide can be 
determined, for example, by lipid-binding assays known to those of skill in the 
art. 

The nucleic acid segments of the present invention, regardless of the 
length of the coding sequence itself, can be combined with other DNA 
sequences, such as promoters, enhancers, polyadenylation signals, additional 
restriction enzyme sites, multiple cloning sites, other coding segments, and 
the like, such that their overall length can vary considerably. It is therefore 
contemplated that a nucleic acid fragment of almost any length can be 
employed, with the total length preferably being limited by the ease of 
preparation and use in the intended recombinant DNA protocol. For example, 
nucleic acid fragments can be prepared which include a short stretch 
complementary to a nucleic acid sequence set forth in SEQ ID NOs:1 and 3, 
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such as about 10 nucleotides, and which are up to 10,000 or 5,000 base pairs 
in length. DNA segments with total lengths of about 4,000, 3,000, 2,000, 
1,000, 500, 200, 100, and about 50 base pairs in length are also useful. 

The DNA segments of the present invention encompass biologically 
5 functional equivalents of engineered PPAR or PPAR LBD . mutant 
polypeptides. Such sequences can rise as a consequence of codon 
redundancy and functional equivalency that are known to occur naturally 
within nucleic acid sequences and the proteins thus encoded. Alternatively, 
functionally equivalent proteins or polypeptides can be created via the 

10 application of recombinant DNA technology, in which changes in the protein 
structure can be engineered, based on considerations of the properties of the 
amino acids being exchanged. Changes can be introduced through the 
application of site-directed mutagenesis techniques, e.g., to introduce 
improvements to the antigenicity of the protein or to test variants of an 

15 engineered PPAR or PPAR LBD mutant of the present invention in order to 
examine the degree of lipid-binding activity, or other activity at the molecular 
level. Various site-directed mutagenesis techniques are known to those of 
skill in the art and can be employed in the present invention. 

The invention further encompasses fusion proteins and peptides 

20 wherein an engineered PPAR or PPAR LBD mutant coding region of the 
present invention is aligned within the same expression unit with other 
proteins or peptides having desired functions, such as for purification or 
immunodetection purposes. 

Recombinant vectors form important further aspects of the present 

25 invention. Particularly useful vectors are those in which the coding portion of 
the DNA segment is positioned under the control of a promoter. The promoter 
can be that naturally associated with a PPAR gene, as can be obtained by 
isolating the 5' non-coding sequences located upstream of the coding 
segment or exon, for example, using recombinant cloning and/or PCR 

30 technology and/or other methods known in the art, in conjunction with the 
compositions disclosed herein. 
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ln other embodiments, certain advantages will be gained by positioning 
the coding DNA segment under the control of a recombinant, or heterologous, 
promoter. As used herein, a recombinant or heterologous promoter is a 
promoter that is not normally associated with a PPAR gene in its natural 
5 environment. Such promoters can include promoters isolated from bacterial, 
viral, eukaryotic, or mammalian cells. Naturally, it will be important to employ 
a promoter that effectively directs the expression of the DNA segment in the 
cell type chosen for expression. The use of promoter and cell type 
combinations for protein expression is generally known to those of skill in the 

10 art of molecular biology ( See, e.g. , Sambrook etal. , (1 989) Molecular Cloning: 
A Laboratory Manual , Cold Spring Harbor Laboratory, New York, specifically 
incorporated herein by reference). The promoters employed can be 
constitutive or inducible and can be used under the appropriate conditions to 
direct high level expression of the introduced DNA segment, such as is 

15 advantageous in the large-scale production of recombinant proteins or 
peptides. One preferred promoter system contemplated for use in high-level 
expression is a T7 promoter-based system. 

20 

* The Role of the Three-Dimensional Structure of the PPARoc LDB in 
Solving Additional PPAR Crystals 

Because polypeptides can crystallize in more than one crystal form, the 
structural coordinates of a PPARa LBD, or portions thereof, as provided by 
the present invention, are particularly useful in solving the structure of other 
crystal forms of PPARa and the crystalline forms of other PPARs. The 
coordinates provided in the present invention can also be used to solve the 
structure of PPAR or PPAR LBD mutants (such as those described in Section 
IX above), PPAR LDB co-complexes, or of the crystalline form of any other 
protein with significant amino acid sequence homology to any functional 
domain of PPAR. 



25 



30 
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X.A. Determining the Three-Dimensional Structure of a Polypeptide 
Using the Three-Dimensional Structure of the PPARg LBD as a 
Template in Molecular Replacement 
One method that can be employed for the purpose of solving additional 
5 PPAR crystal structures is molecular replacement. See generally , Rossmann , 
ed, (1972) The Molecular Replacement Method . Gordon & Breach, New York. 
In the molecular replacement method, the unknown crystal structure, whether 

■ 

it is another crystal form of a PPARa or a PPARa LBD, (i.e. a PPARa or a 
PPARa LBD mutant), or a PPARa or a PPARa LBD polypeptide complexed 

10 with another compound (a "co-complex"), or the crystal of some other protein 
with significant amino acid sequence homology to any functional region of the 
PPARa LBD, can be determined using the PPARa LBD structure coordinates 
provided in Table 2. This method provides an accurate structural form for the 
unknown crystal more quickly and efficiently than attempting to determine 

1 5 such information ab initio. 

In addition, in accordance with this invention, PPARa or PPARa LBD 
mutants can be crystallized in complex with known modulators. The crystal 
structures of a series of such complexes can then be solved by molecular 
replacement and compared with that of wild-type PPARa or the wild-type 
20 PPARa LBD. Potential sites for modification within the various binding sites 
of the enzyme can thus be identified. This information provides an additional 
tool for determining the most efficient binding interactions, for example, 
increased hydrophobic interactions, between the PPARa LBD and a chemical 
entity or compound. 

25 All of the complexes referred to in the present disclosure can be 

studied using X-ray diffraction techniques (See, e.g. , Blundell & Johnson 
(1985) Method.EnzymoL, 114A & 115B, (Wyckoff et ah , eds.), Academic 
Press) and can be refined using computer software, such as the X-PLOR™ 
program (Brunger , (1992) X-PLOR, Version 3.1. A System for X-ray 

30 Crystallography and NMR, Yale University Press, New Haven, Connecticut; 
X-PLOR is available from Molecular Simulations, Inc., San Diego, California). 
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This information can thus be used to optimize known classes of PPAR and 
PPAR LBD modulators, and more importantly, to design and synthesize novel 
classes of PPAR and PPAR LBD modulators. 

5 Laboratory Examples 

The following Laboratory Examples have been included to illustrate 
preferred modes of the invention. Certain aspects of the following Laboratory 
Examples are described in terms of techniques and procedures found or 
contemplated by the present inventors to work well in the practice of the 

10 invention. These Laboratory Examples are exemplified through the use of 
standard laboratory practices of the inventors. In light of the present 
disclosure and the general level of skill in the art, those of skill will appreciate 
that the following Laboratory Examples are intended to be exemplary only and 
that numerous changes, modifications and alterations can be employed 

1 5 without departing from the spirit and scope of the invention. 



Laboratory Example 1 
Protein Preparation 
The nucleic acid sequence encoding the PPARot ligand binding domain 
20 (amino acids 192-468) tagged with MKKGHHHHHHG (SEQ ID NO: 9) was 
operatively linked to and expressed using the T7 promoter of plasmid vector 
pRSETA. BL21(DE3) E. coli cells transformed with this expression vector 
were grown at 24°C in shaker flasks for 66 hours on 2xYT medium (16 g/L 
Bacto-Tryptone, 10 g/L yeast extract, 5 g/L NaCI, QC with distilled water) with 
25 50 mg/L carbenicillin to an OD 6 oo of approximately 9.0. The cells were 
harvested, resuspended with 20 ml extract Buffer (20 mM HEPES, pH 7.5, 50 
mM imidozle, 250 mM NaCI and a pinch of lysozyme) per liter of cells and 
were lysed by sonication for 20 minutes on ice. The lysed cells were 
centrifuged at 40,000g for 40 minutes and the supernatant was loaded on a 
30 100 ml Ni-agarose column. 

The column was washed with 150 ml Buffer A (10% glycerol, 20 mM 
HEPES pH 7.5, 25 mM imidizole) and the protein was eluted with a 450 ml 
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gradient of Buffer B (10% glycerol, 20 mM HEPES pH 7.5, 500 mM imidozle). 
The protein, which eluted at 20% Buffer B, was diluted with one volume of 
Buffer C (20 mM HEPES, pH 7.5, 1 mM EDTA), and loaded on an 100 ml S- 
Sepharose™ (Pharmacia, Peapack, New Jersey) column. The column was 
5 washed with a 1 00 ml Buffer C and the PPARa LBD protein was eluted with a 
200 ml gradient of Buffer D (20 mM HEPES, pH 7.5, 10 mM DTT, 1 M 
ammonium acetate). The PPAR alpha LBD eluted from the column at 43% 
Buffer D. The protein yield was 9 mg/ L of cells grown and was >95% pure, 
as determined by SDS-PAGE analysis. 

1 0 The protein was then diluted to 1 mg/ml with Buffer C such that the 

final buffer composition was 220 mM ammonium acetate, 20 mM HEPES pH 
7.5, 1 mM EDTA and 1 mM DTT. The diluted protein was aliquoted into 9 ml 
aliquots, flash frozen with liquid nitrogen and stored at -80 °C. To prepare 
complexes an individual aliquot was thawed for each compound. The peptide 

1 5 SRC1 (See, Xu et al. , (1 999) Mol. Cell 3: 397-403) was added in a mol ratio of 
1 .5 as a 2mg/ 100 al DMSO stock. The peptide was then added in a mol ratio 
of 5:1 as a 2mg/100 uJ DMSO stock and spun at 4K for 20 min to clarify the 
solution before concentrating in Centriprep™ 30 filtration units (Millipore, 
Bedford, Massachusetts). The solution containing the PPARa LBD-SRC1 

20 complexes was concentrated to approximately 10 mg/ml with 80% yield. The 
complexes were then aliquoted in single use aliquots of 30 ul, flash frozen in 
liquid nitrogen and stored at -80°C. 

Laboratory Example 2 
25 Crystallization and Data Collection 

The crystals disclosed in this invention were grown at room 
temperature using the hanging drop vapor diffusion method. The hanging 
drops comprised 1 ^l of the above protein-ligand solutions, and were mixed 
with an equal volume (1 ul) of well buffer comprising 7% PEG 3350, 200 mM 
30 NaF, and 1 2% 2,5 hexanediol. 
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Before data collection, crystals were transiently mixed with well buffer 
that contained an additional 10% hexanediol as a cryoprotectant, and then 
flash frozen in liquid nitrogen. 

The PPARa crystals formed in the P2i2i2 space group, with a=61.3 A, 
5 b = 103.5A, c = 49. 9A. Each asymmetry unit contained a single PPARa LBD 
with 45% solvent content. Crystals contained Compound 1, the PPARa LBD 
and SRC1 peptide, in a ratio of 1:1:1. Data were collected with a Rigaku R- 
Axis II (Rigaku, Tokyo, Japan) detector in house, or with a MAR CCD detector 
in the IMCA 17ID beam line at the Argonne National Laboratory (Argonne, 
10 Illinois), and the observed reflections were reduced, merged and scaled with 
DENZO™ and SCALEPACK™ in the HKL2000 package ( Otwinowski , (1993) 
in Proceedings of the CCP4 Study Weekend: Data Collection and Processing . 
(Sawyer eta}., eds.), pp.56-62, SERC Daresbury Laboratory, England). 

15 Laboratory Example 3 

Structure Determination and Refinement 
The structure was determined by molecular replacement methods with 
the CCP4 AmoRe program (Collaborative Computational Project Number 4, 
1994; Navaza, (1994) Acta. Cryst A50: 157-163) using the structure 

20 coordinates for the PPAR8 LBD (Xu et al. , (1999) Mol. Cell 3: 397-403), 
residues 167-441, as the initial model (Table 3). The best fitting solution gave 
a correlation coefficiency of 70% and an R-factor of 33%. Model building was 
performed with the software program QUANTA™, and structure refinement 
was achieved using the CNS software program ( Brunger et al. , (1998) Acta. 

25 Crystallogr. D54: 905-921). Structure refinement involved multiple cycles of 
manual rebuilding. The final structure (Table 2 and Figures 1 , 2, 4 and 7) 
includes one PPARa LBD, one SRC1 peptide and Compound 1. The 
statistics of the structures are summarized in Table 1. 



WO 02/064632 



PCT/US02/03278 



-81- 

Laboratory Example 4 
Computational Analysis 
Surface areas were calculated using both the Connolly MS program 
(Connolly , (1983) Science 221: 709-713) and the MVP program ( Lambert , 
5 (1997) in Practical Application of Computer-Aided Drug Design , (Charifson, 
ed.), pp. 243-303, Marcel-Dekker, New York). The C2-symmetry axis, 
sequence alignments and binding site accessible waters were calculated 
using the software program MVP. 
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TABLE 1 

PARAMETERS FOR THE CRYSTALLINE FORM OF THE LIGAND BINDING 
DOMAIN (RESIDUES 207-441 ) OF PPARct IN COMPLEX WITH 

COMPOUND 1 



X-ray Source 


Glaxo Facility 


I MCA 


Space Group 


P2!2i2 


P2 1 2 1 2 


Resolution (A) 


20.0- 2.4 


20.0-1 .8 


Unique Reflections (N) 


64,772 


30,147 


Completeness (%) 


95.2 


99.4 


l/g(last shell) 


25.4 (3.3) 


39.5 (5.2) 


1 x sym \ fo j 


Q A 


Hr.O 


Refinement Statistics 






. R factor* (%)(2o) 




22.9 


R free 0 (%)(2ct) 




26.9 


R.M.S.D. Bond 






Lengths (A) 




0.010 


R.M.S.D. Bond 






Angles (degrees) 




1.357 


Number of H 2 0 Molecules 




396 


Total Non-hydrogen Atoms 




2659 



r.m.s.d. is the root mean square deviation from ideal geometry. 



Rsym = Z I Uvg - I/ | / Z '/ 

Rfactor = Z ! Fp - Fpcalc | / Z Fp . where F p and F pca ic are observed and 
10 calculated structure factors, respectively 

C R free is calculated from a randomly chosen 10% of reflections that have 
never been used in refinement, and R factor is calculated for the remaining 90% 
of reflections. 
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TABLE 2 

ATOMIC STRUCTURE COORDINATE DATA OBTAINED FROM X-RAY 
DIFRACTTON FROM THE LIGAND BINDING DOMAIN OF PPARa in 

COMPLEX WITH COUPOUND 1 



ATOM 


TVPT7 


T?Tn CTTYFTC 
XVE/OJJJUl} 


rKUlblJN # 


# 


X 


Y 


z 


occ 


B 


1 


PR 


A CT> 


A 


202 


14.533 


5.575 


19.857 


1.00 


78.46 




prt 




A 

A 


202 


14.270 


4.099 


20.079 


1.00 


79.53 


-J 
O 


VJJLM 


Aor 


A 

A 


r> /-\ /-% 
202 


13.370 


3.547 


19.410 


1.00 


80.23 


A 




Aor 


A 

A 


202 


14.963 


3.492 


20.921 


1.00 


80.14 


J 




A CD 

Aor 


A 


202 


15.243 


7.375 


18.274 


1.00 


76.33 


0 


\j 


A CD 

Aor 


A 

A 


202 


14.377 


8.174 


17.918 


1.00 


76.50 


7 




A CD 

Aor 


A 

A 


202 


16.101 


5.066 


18.013 


1.00 


77.36 


0 

o 


p a 


A CD 

Aor 


A 

A 


202 


14.925 


5.890 


18.412 


1.00 


77.08 


y 


XT 
IN 


T T7TT 


A 

A 


O. AO 

203 


16.488 


7.739 


18.565 


1.00 


74.80 


1 n 


PA 


T TJTT 

JLbU 


A 

A 


203 


1 zr at~» 

16.927 


9.125 


■4 ft A J*/\ 

18.460 


1.00 


72.96 


1 1 


PT* 




A 

A 


OAO 

203 


18.150 


a no 

9.378 


1 A O A A 

19.344 


1.00 


73.58 






T T7TT 


A 

A 


203 


1 O O 1 

18.821 


1 A *7/1 A 

10.740 


19.132 


1 AA 

1.00 


73.64 


1 3 




T T7TT 


A 
A 


203 


1 O A/C1 

18.061 


11 O 1 c 

11.815 


1 A OAC 

19.895 


1 AA 

1.00 


no a A 

73.44 


14 




L,Et\J 


A 




2U.ZCO 


1 C\ £Q(\ 

IU.00U 


i o cm 


1 AA 

1.U0 


73.65 


15 


P 




A 




1 7 OA/I 


• 0 At 8 


17 A1Q 


i.UU 


71 CC 


16 


o 


LEU 


A 


203 


17 251 


10 561 


16 574 
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4jj 


1 A C\£ZS 


SA CO/1 

j4.jz4 


A C 1 O 
-U.J 1 J 


1 AA 
l.UU 


CA K 
jU.jO 


1 QOA 

lyyu 


XT 

JN 


ATA 

ALA 


A 

A 


/1C/I 

4j4 


1 H *70>C 

lo./oo 


CO C07 


A 0/C7 
U.JO/ 


1 AA 
l.UU 


AQ 10 
47.JZ 


1 QQ1 


L,A 


AT A 
ALA 


a" 

A 


/l C/l 

4j4 


lo.uoy 


CO 180 
jZ.JoZ 


A 877 


1 AA 
l.UU 


A7 ^0 
4 / .07 


1 QQO 
177Z 


PR 


AT A 
ALA 


A 
A 


/I C/1 

4j4 


lO.l 17 


C 1 O/t 1 
j 1.Z41 


A 1 AO 

-U.1UZ 


1 AA 
l.UU 


A8 ^< 
45.00 


1 001 

177J 


V-/ 


AT A 
ALA 


A 
A 


4j4 


lo.ooo 


CI Q88 

J 1.700 


O OOO 


1 AA 
l.UU 


A& AQ 
40.47 


1004 
i yyn 


o 


AT A 
AJLA 


A 
A 


ASA 
HDH 


17 CC<k 


CI QA(\ 


O 171 
Z.J / J 


1 AA 
l.UU 


Al R7 


100S 
i 77 j 


"NT 

IN 


AT A 

AJLA 


A 
A 


acc 

43 J 


1 ^ R07 


C1 7A7 
D 1« /U / 


1 1QC 


1 AA 
l.UU 


41 RO 

4J.07 


1006 


PA 


AT A 


I A 
A 


ASS 
HDD 


1 £ OCA 


C1 

j 1 .04U 


A Cll 
4.J J J 


1 AA 
l.UU 


41 00 
41 .ZZ 


1997 


PR 


AT A 


A 
A 


ASS 
HDD 


1 D. jUO 


C1 Q44 
J 1 .?HH 


C C£R 


1 AA 
1 .UU 


41 A1 
41 .Ul 


1998 

j l 7 7 o 


P 


AT A 


A 


ASS 
HDD 


10.j*t 1 


AQ ROO 
4>.oZ7 


4 704 
4. / Z4 


1 AA 
l.UU 


10 R0 

J7.0Z 


1999 

l. S S J 


o 

V7 


AT A 


A 


HDD 


1 1 7 flfiO 


i 40 146 
47. 140 


i oon 

J.77V 


1 AA 

l.UU 


41 CO 
41.JZ 


2000 


N 


LFU 


A 


ASfi 


1 J.UZJ 


40 190 

*t7.JZ7 


* 77R 
J. / zo 


1 AA 


16 01 

jO.u 1 


2001 


CA 


LEU 


A 


456 


15 580 


47 904 


6 063 


1 00 


32 30 


2002 


CB 


LEU 


A 


456 


16.744 


47.508 


6.981 


1.00 


33.09 


2003 


CG 


LEU 


A 


456 


18.083 


47.052 


6.390 

* 


1.00 


33.05 


2004 


CD1 


LEU 


A 


456 


18.977 


46.576 


7.525 


1.00 


32.55 


2005 


CD2 


LEU 


A 


456 


17.870 


45.925 


5.391 


1.00 


31.55 


2006 


C 


LEU 


A 


456 


14.272 


47.560 


6.769 


1.00 


29.76 
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2007 


0 


LEU 


A 


456 




48 545 


7 574 


1 00 

X . uu 


26 72 

X.VJ. / A* 


2008 


N 


HIS 


A 


457 


11 745 

1 J. ' ^./ 


46 ^78 


6 465 


1 00 


96 87 


2009 


CA 


HIS 


A 


457 




45 897 

tJ.07 / 


7 061 


1 00 


25 80 

x>J.Ov 


2010 


CB 


HIS 


A 


457 


12 111 


44 558 


6 421 


1 00 


24 71 


2011 


CG 


HTS 


A 


457 


10 R46 

1 v.O*tV/ 


41 96R 

*TJ .7UO 


6 961 


1 00 


25 2R 

xv J 


2012 


CD2 


HTS 


A 

A 


457 

*+•/ / 


10 567 


41 401 


R 160 


1 00 

x .V/V/ 


24 58 


2013 


ND1 


HIS 

XXXWJ 


A 


457 


0 6R0 


41 901 


6 999 


1 00 


96 59 


2014 


CE1 


HIS 

XXX LJ 


A 


457 


8 740 

O. / tv 


41 116 


6 950 


1 00 


25 66 

.V/V/ 


2015 


NE2 

X> X^X» 


HTS 


A 


457 
*t j / 


Q ?<1 


41 001 


R 197 


1 00 


26 42 

xv VJ ."xv 


2016 


c 


HTS 


A 


457 


19 711 

1 xV. / i I 


45 716 


R 567 


1 00 


25 90 

A.«/ .✓V 


2017 


o 


HTS 

IxJLO 


A 

A. 


457 


1 1 794 


45 111 


9 019 


1 00 


25 20 

xv J .xv v 


2018 

AfV J O 


N 


PPO 


A 


45 R 


1 1 £69 


45 995 


9 16R 


1 00 

X .V/V/ 


25 60 


2019 




ppn 


A 

n 


45 R 


10 16R 

1 v.JOO 


46 541 


R 941 


1 00 


26 OR 

xvV/.V/O 


2020 




ppn 

ri\u 


A 


45R 


11719 
11./ iy 


45 R6R 


10 R99 


1 00 


25 49 

xv J .'-rxv 


9071 

xv V/xV 1 




ppn 


A 


45R 


10 957 


46 019 


1 1 999 
1 1 ,z,z.y 


1 00 


95 91 
£.j*y i 




V>VJ 


PPO 


A 


4SR 
*r JO 


0 777 


47 041 


10 941 


1 00 


97 1R 

xv / .-JO 


2021 


P 


PRO 


A 


45R 


19 114 

lZr. 0 1*T 


44 557 


1 1 149 


1 00 

I .\J\J 


95 07 


9074 

XVVZ»T 




PPO 


A 


45R 


11151 

1 J. 1«7 1 


44 559 
h*+. d jy 


19 946 


1 00 

1 .\J\J 


95 99 

XV J. XV 7 


2025 

XVUxV*/ 




T PTT 


A 


459 


1 1 R7R 

1 1.0 / o 


41 416 


10 775 


1 00 

1 .V/V/ 


91 71 

XV J . / 1 


2026 

X* V/ X* V 


CA 


LEU 


A 

A 


459 


19 


49 141 

* 


1 1 914 


1 00 

J. • w 


22 76 

XV A. • / V 


2027 


CB 


LEU 


A 

A 


459 


1 1 618 


41 002 


10 510 


1 00 

X . vv 


22 31 

xVxV.«J X 


2028 


CG 


LEU 


A 


459 


19 098 


! 19 582 


10 86R 


1 00 

X »vv 


22 34 


2029 


CD1 


LEU 


A 

X X. 


459 


11 955 


39 318 


12 368 


1 00 

X . V V 


23 76 

Xi J . / V 


2030 


CD2 


LEU 


A 


459 


11 284 

X X 


38 567 


10 076 


1 00 

X »w 


22 76 

Xi<A«i / V/ 


2031 


c 


LEU 


A 


459 


13 872 


41 998 


10 930 


1 00 

X #w 


22 18 


2032 


o 


LEU 


A 


459 


14 623 


41 493 


11 761 

X X • / \J X 


1 00 

X . w 


21 97 


2033 


N 


LEU 


A 


460 


14.301 


42 438 


9 753 


1 00 

X • w w 


21 66 

X • WW 


2034 


CA 


LEU 


A 


460 


15.710 


42.337 


9.397 


1.00 


21.60 


2035 


CB 


LEU 


A 


460 


15.892 


42.619 


7.902 


1.00 


20.70 


2036 


CG 


LEU 


A 


460 


14.974 


41.760 


7.022 


1.00 


19.18 


2037 


CD I 


LEU 


A 


460 


15.343 


41.949 


5.560 


1.00 


20.00 


2038 


CD2 


LEU 


A 


460 


15.098 


40.285 


7.418 


1.00 


16.85 


2039 


C 


LEU 


A 


460 


16.539 


43.305 


10.232 


1.00 


23.25 



WO 02/064632 



PC1YUS02/03278 



148 



2040 


o 


LEU 


A 
i x 


460 


17 679 


43 015 


10 606 


1 00 


21 71 


2041 


N 

m, ^ 


GLN 


A 


461 


15 952 


44 453 


10 541 


1 00 


23 59 


2042 


CA 


GLN 


A 


461 


16 645 


45 444 


11 339 


1 00 


26 43 


2043 


CB 


GLN 


A 


461 


15 808 


46 720 


11 444 

1 JL . » » » , 


1 00 


28.61 


2044 


CG 


GIN 


A 


461 


16 491 


47 837 


12 219 


1 00 

X t w 


34 25 


2045 


CD 


GT N 


A 
n 


461 


17 776 

11*1 1 XJ 


48 316 


1 1 564 


1 00 


36 90 


2046 


OE1 


GTN 


A 


461 


18 513 


49 110 


12 145 


1 00 

X «w 


40 50 


2047 


NE2 


GLN 


A 


461 


18 045 


47 843 


10 353 


1 00 


38.17 


2048 


c 


GT N 


A 


461 


16 950 


44 899 

^fr .077 


12 734 


1 00 


25 75 


2049 


o 




A 

A 


461 


18 057 


45 078 


13 235 


1 00 

X . V w 


27.57 


2050 


N 


GT IT 


A 


462 


15 989 


44 224 


13 361 


1 00 


25.39 


2051 


PA 


GT TT 


A 


462 


16 241 


43 696 


14 699 


1 00 


26.81 






GT TT 


A 
A 


46? 


14 930 


43 290 


15 401 


1 00 


28 93 


2053 


PG 


GT TT 


A 


462 


14 134 

1*T. 1J*T 


42 155 


14 784 


1 00 

X . w 


30 21 


9054 


PD 


GT TT 


A 


462 


12 838 


41 880 


15 553 


1 00 


32 16 


2055 




GT TT 


A 
A 


462 


11 991 


42 795 


15 644 


1 00 


32 22 


2056 


OE2 


GT TT 


A 




12 665 


40 755 


16 071 


1 00 

X . w 


30 73 


2057 


c 


GT TT 


A 

A 


462 


17 240 


42 541 


14 688 

X~. VIU VJ 


1 00 

X . w 


25 84 


2058 


o 


GT TT 


A 

A 


462 


17 970 


42 332 


15 659 


1 00 


25 91 


2059 


N 


TLE 


A 

A 


463 


17 291 


41 797 


13 589 


1 00 


23 23 


2060 


CA 


TLB 

X X-IX.J 


A 


463 


18 239 


40 699 


13 497 


1 00 

X . w V 


22 62 

m*m+ « \/X* 


2061 


CB 


TLE 


A 

x x 


463 


17 942 


39 793 


12 273 


1 00 


23 65 


2062 


CG2 


ILE 


A 


463 


19 115 

J. ^ • X X 


38 833 


12 020 


1.00 


21.96 


2063 


CGI 


ILE 


A 


463 


16.650 


39.007 


12.522 


1.00 


22.25 

m+mm ■ mm%0 


2064 


CD1 


ILE 


A 


463 


16.196 


38.160 


11.338 


LOO 


24.20 


2065 


c 


ILE 


A 


463 


19.658 


41.263 


13.396 


1.00 


23.04 


2066 


0 


ILE 


A 


463 


20.568 


40.782 


14.062 

-*■ ■ • V WJM 


1.00 


20.73 


2067 


N 


TYR 


A 


464 


19.839 


42.298 


12.579 


1.00 


23.82 


2068 


CA 


TYR 


A 


464 


21.160 


42.899 


12.400 


1.00 


25.85 


2069 


CB 


TYR 


A 


464 


21.226 


43.636 


11.063 


1.00 


24.50 


2070 


CG 


TYR 


A 


464 


21.403 


42.697 


9.892 


1.00 


22.75 


2071 


CD1 


TYR 


A 


464 


22.563 


41.933 


9.758 


1.00 


23.07 


2072 


CE1 


TYR 


A 


464 


22.712 


41.029 


8.703 


1.00 


22.58 
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2073 


CD2 


TYR 


A 


464 


20 397 

x>\y v / 


42 538 

~XV **J ~J \J 


8 943 


1 00 


23 44 


2074 


CE2 


TYR 


A 


464 


20 537 


41 644 


1 893 


1 00 


20 95 

x*\s • ✓ «,/ 


2075 


cz 


TYR 


A 

A X 


464 


21 692 


40 894 


1 779 


1 00 

X . \J\J 


22 05 


2076 


OH 


TYR 


A 

A X 


464 


^1 819 


40 001 

^V/. 1/1/ i 


6 744 


1 00 

1 • V/ v 


21 31 


2077 


c 


TYR 


A 

XX 


464 


21 608 


43 827 


13 523 


1 00 


27 80 

x. / ,t>V 


2078 


o 


TYR 

X X XX. 


A 


464 

*TV*T 


92 803 


44 064 


13 685 


1 00 


28 04 


2079 


N 


ARG 

XXX W_J 


A 

XX 


465 


20 661 

XyV .VJV 1 


44 350 


14 294 


1 00 


29 91 


2080 


CA 


ARG 


A 

x X 


465 


21 002 


45 238 


15 403 

X »/ i*TVJ 


1 00 


33 29 


2081 


CB 


ARG 


A 

A 


465 


19 731 


45 764 

"T»/ . / V^ 


16 074 


1 00 


35 76 


2082 


CG 


ARG 


A 


465 


19 978 

A 7 ..7 / V 


46 538 


17 368 

X / .J vU 


1 00 


38 99 


2083 


CD 


ARG 


A 


465 


18 669 

1 O.W7 


47 023 


17 976 


1 00 

A ,VV 


42 46 

rX » i \J 


2084 


NE 

A ~ J— ' 


ARG 


A 

ix 


465 

*TV»/ 


18 868 


47 706 


19 252 


1 00 


44 44 


2085 

X> V w 


CZ 


ARG 


A 

XX 


465 


1 7 897 


48 297 


19 943 

1 7.7*T«/ 


1 00 

A .VV 


46 85 


2086 


NH1 


ARG 

AivvJ 


A 

A 


465 


16 652 


48 994 


19 482 


1 00 


47 83 

"T / «UJ 


2087 


NH2 


ARG 


A 

A 


465 


18 167 

1 O. IV/ / 


48 890 


21 097 

4. 1 .VI7 f 


1 00 

A »VV 


47 38 

T / .«/ O 


2088 




A1*G 


A 

A 


465 


91 858 
xvi .0.70 


44 503 


16 436 


1 00 

A .VV 


34 75 


2089 




ARG 


A 

A 


465 


21 406 


43 540 

*rJ .«/*rV 


17 055 


1 00 

A » W 


34 47 


2090 


N 


ASP 


A 

A 


466 


23 095 

x. J iU7J 


44 961 

*T*T.-7VF X 


16 608 


1 00 

A .VV 


36 62 


2091 


I CA 


ASP 


A 

A 


466 


94 096 


44 372 


17 569 


1 00 

A .VV 


38 83 


2092 

X.V/-^ A* 


CB 


ASP 


A 

XX 


466 


93 456 

X**/.*TfcJV/ 


44 470 


18 988 


1 00 

A . VV/ 


39 46 


2093 


CG 


ASP 


A 


466 


23 231 


45 902 


19 429 


1 00 

X ■ w 


40.61 


2094 


OD1 


ASP 


A 

X X 


466 


24 151 

xj~. 1 A 


46 729 


19 242 


1 00 

X ■ w 


40 51 


2095 


OD2 


ASP 


A 


466 


22 141 


46 198 


19 968 


1 00 


40.94 


2096 


C 


ASP 


A 


466 


24 407 


42 921 


17 288 


1 00 


39.96 


2097 


o 


ASP 


A 


466 


24.783 


42 187 


18.203 


1 00 


39.85 


2098 


N 


MET 


A 


467 


24.311 


42 500 


16.031 


1 00 


41.66 


2099 


CA 


MET 


A 


467 


24 674 


41 132 


15 678 


1 00 


43 69 


2100 


CB 


MET 


A 


467 


24.328 


40.841 


14.217 


1.00 


43.01 


2101 


CG 


MET 


A 


467 


24.631 


39.416 


13.797 


1.00 


43.53 


2102 


SD 


MET 


A 


467 


24.328 


39.125 


12.053 


1.00 


45.18 


2103 


CE 


MET 


A 


467 


22.544 


38.865 


12.066 


1.00 


44.23 


2104 


C 


MET 


A 


467 


26.175 


40.966 


15.889 


1.00 


45.56 


2105 


0 


MET 


A 


467 


26.619 


40.175 


16.722 


1.00 


46.26 
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2106 


N 


TYR 


A 


468 


26.950 


41.724 


15.122 


1.00 


46.99 


2107 


CA 


TYR 


A 


468 


28.403 

Art Xrf • ■ W +r 


41.691 

■ At V ^ *■ 


15.215 


1.00 


48.84 


2108 


CB 


TYR 


A 


468 


29.016 


41.118 

• A 1 A- A V^ 


13.930 


1.00 


48.71 


2109 


CG 


TYR 


A 


468 


28.731 

Aw V/ * r «J a 


39.646 


13.712 


1.00 


48.95 


2110 


CD1 


TYR 

A, A X» 


A 

x a. 


468 

I v vj- 


27 913 


39 218 


12 665 

X Aw * VJ VJ w 


1.00 


49.23 


2111 


CE1 


TYR 

Ai A» JL, 


A 

A A. 


468 

• VJ V 


27 644 

a* / • vj i * 


37 861 


12.468 


1.00 


49.07 


2112 


CD2 


TYR 


A 

a x 


468 

* V/ VJ 


29 273 

AV « JW # *v 


38 680 


14.560 

A ft «J VJ W 


1.00 


49.57 

i rt^ * w r 


2113 


CE2 


TYR 


A 

4- Jk 


468 


29.011 

AW,** » V X X 


37.324 


14.374 


1.00 


49.37 


2114 


CZ 


TYR 

Jk A A V 


A 

X A 


468 

1 vj ^J 


28.197 

Art \-/ • A. / 


36 921 

w V/ • «^ xw X 


13.328 

A «^ * «S A^ W 


1.00 


49.98 


2115 


OH 


TYR 

Jk X XX 


A 

A Jk 


468 

r v* vj 


27.940 

XV / • rtr 1 V/ 


35 580 

%J W ♦ +s VJ VJ 


13 147 

X t X 1 # 


1.00 


49.98 


2116 


c 


TYR 

X X X. 


A 

A Jk 


468 


28 928 

XV VJ • X* V 


43 101 

1 *J • A VJ Jk 


15 462 


1.00 


49.62 

T rtF • X^ Arf 


2117 

Art A A / 


o 


TYR 

• 


A 


468 


29 141 


43 829 


14 472 

X F * f / JV 


1 00 

A a XX W 


50.21 

*^ VJ »Art A 


2118 


OT 


TYR 


A 

£ X 


468 


29 091 


43 471 


16 646 

X VJ • VJ ■ VJ 


1.00 

A « X# W 


50.66 

^ x# • x* xy 


2119 

Ajrt A A ^ 


CB 


GLU 

XJ V-/ 


B 


685 

V/ S*J %J 


18 548 


43 302 

w • w V/ JW 


21 979 

J** x • ✓ / ✓ 


1.00 

a • xy^ w 


63.65 

X^ w/ • X#%< 


2120 

A* JL A* V/ 


CG 


GLU 

VJ X-J K^f 


B 


685 


18 317 


43 165 


23 475 


1.00 


63.87 


2121 


CD 


GLU 


B 


685 


18 479 


41 737 


23 956 

JA* iJ • «^ mp/ VJ 


1.00 


64.23 

Xjr « • «jr 


2122 


OE1 


GLU 


B 


685 


17 606 


40 903 


23 635 


1.00 

A • VJ V 


64.28 

VJ I » Art XJ 


2123 

Art A AV*J 


OE2 


GLU 


B 


685 


19 478 


41 445 


24 647 

xw t • vy i f 


1.00 

X » \J XJ 


64.53 


2124 

as* x xv r 


c 


GLU 


B 


685 


16 486 


44447 


21 144 

JW X « X | 


1.00 

a • xy x# 


61.94 


2125 


o 

x^ 


GLU 

x-j x_j vy 


B 


685 


15.986 


44.713 


20.050 


1.00 


62.27 


2126 


N 


GLU 


B 


685 


18.323 


45.755 


22.227 

*■? Art 4 Art » 


1.00 


62.90 

Art * X^ 


2127 


CA 


GLU 


B 

a- ^ 


685 

XJ vj 


17.990 


44.588 


21.363 


1.00 


62.73 


2128 


N 


ARG 


B 


686 


15.773 


44.020 


22.185 


1.00 


60.58 


2129 


CA 


ARG 


B 


686 


14.324 


43.847 


22.118 


1.00 


59.13 


2130 


CB 


ARG 


B 


686 


13.677 


45.103 


21.527 


1.00 


60.07 


2131 


CG 


ARG 


B 


686 


12.155 


45.100 

# 


21.495 


1.00 


61.27 


2132 


CD 


ARG 


B 


686 


11.558 


45.589 


22.807 


1.00 


62.16 


2133 


NE 


ARG 


B 


686 


10.176 


46.030 

• 


22.634 


1.00 


63.43 


2134 


CZ 


ARG 


B 


686 


9.159 


45.223 


22.350 


1.00 


64.44 


2135 


NH1 


ARG 


B 


686 


9.361 


43.920 


22.210 
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LOO 


56.74 


2611 


0 


HOH 


s 


383 


27.823 


7.313 


-8.539 


1.00 


49.75 


2612 


0 


HOH 


s 


384 


-1.210 


31.371 


1 8.699 


1.00 


53.14 


2613 


0 


HOH 


s 


385 


7.322 


35.525 


28.034 


1.00 


61.44 


2614 


0 


HOH 


s 


386 


14.217 


16.674 


-13.221 


1.00 


42.98 


2615 


0 


HOH 


s 


387 


20.078 


59.930 


-2.925 


1.00 


54.66 


2616 


0 


HOH 


s 


388 


28.936 


2.000 


15.009 


1.00 


47.04 


2617 


0 


HOH 


s 


389 


14.166 


28.692 


31.577 


1.00 


59.65 


2618 


o 


HOH 


s 


390 


20.846 


52.499 


14.026 


1.00 


61.68 


2619 


o 


HOH 


s 


391 


13.294 


5.368 


25.444 


1.00 


52.09 


2620 


0 


HOH 


s 


392 


26:874 


59.625 


-2.846 


1.00 


46.54 


2621 


o 


HOH 


s 


393 


11.393 


5.392 


16.760 


1.00 


45.29 


2622 


' o 


HOH 


s 


394 


30.608 


57.650 


11.108 


LOO 


63.95 


2623 


o 


HOH 


s 


395 


33.858 


39.183 


-13.250 


1.00 


54.28 


2624 


o 


HOH 


s 


396 


16.117 


14.396 


-7.011 


LOO 


46.14 


2625 


o 


HOH 


s 


397 


37.725 


38.263 


6.165 


1.00 


64.28 


2626 


o 


HOH 


s 


398 


6.555 


42.050 


-19.474 


LOO 


48.56 


2627 


CIA 


735 


c 


— j— 


19.341 


40.726 


3.997 


LOO 


15.91 


2628 


OIC 


735 


c 


1 


18.234 


40.214 


4.328 


LOO 


17.26 


2629 


01B 


735 


c 




20.387 


40.473 


4.647 


LOO 


17.17 


2630 


C1D 


735 


c 




19.458 


41.690 


2.760 


LOO 


14.73 


2631 


C1X 


735 


c 




19.838 


43.096 


3.263 


LOO 


16.66 


2632 


C1Y 


735 


c 




18.087 


41.870 


1.985 


LOO 


16.99 


2633 


OlE 


735 


c 




20.617 


41.213 


1.907 


LOO 


15.62 
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2634 


C1F 


735 


C 


1 


20.412 


40.092 


1.049 


1.00 


12.70 


2635 


C1G 


735 


C 


1 


20.433 


40.315 


-0.334 


1.00 


16.64 


2636 


C1I 

• 


735 


C 


1 


20.226 


39.243 


-1.204 


1.00 


15.86 


2637 


C1K 


735 


C 


1 


19.985 


37.892 


-0.696 


1.00 


15.12 


2638 


CU 


735 


c 


1 


19.969 


37.669 


0.700 


1.00 


15.25 


2639 


C1H 


735 


c 


1 


20.181 


38.761 


1.576 


1.00 


16.34 


2640 


C1L 


735 


c 


1 


19.744 


36.701 


-1.667 


1.00 


16.46 


2641 


N1M 


735 


c 


1 


19.050 


37.255 


-2.868 


1.00 


16.48 


2642 


C2A 


735 


c 


— r" 


17.700 


37.386 


-3.035 


1.00 


19.45 


2643 


02A 


735 


c 


1 


16.905 


37.016 


-2.143 


1.00 


20.40 


2644 


S2C 


735 


c 


1 


15.441 


38.082 


-4.381 


1.00 


18.50 


2645 


C2B 


735 


c 


-1—1 


17.156 


37.942 


-4.212 


1.00 


18.76 


2646 


C2D 


735 


c 


1 


17.793 


38.466 


-5.406 


1.00 


19.70 


2647 


C2G 


735 


c 


1 


19.306 


38.558 


-5.720 


1.00 


20.26 


2648 


N2E 


735 


c 


1 


16.928 


38.919 


-6.324 


1.00 


17.68 


2649 


C2F 


735 


c 


1 


15.614 


38.789 


-5.945 


1.00 


19.94 


2650 


C2H 


735 


c 


1 


14.528 


39.186 


-6.744 


1.00 


21.71 


2651 


C2J 


735 


c 


1 


13.173 


39.004 


-6.286 


1.00 


20.47 


2652 


C2L 


735 


c 


1 


12.084 


39.400 


-7.096 


1.00 


22.73 


2653 


C2M 


735 


c 


1 


12.305 


39.993 


-8.391 


1.00 


20.47 


2654 


C2K 


735 


c 


1 


13.651 


40.175 


-8.842 


1.00 


23.90 


2655 


C2I 


735 


c 




14.735 


39.782 


-8.042 


1.00 


22.13 


2656 


C2N 


735 


c 




11.115 


40.435 


-9.308 


1.00 


25.97 


2657 


F2P 


735 


c 




10.952 


39.499 


-10.296 


1.00 


31.88 


2658 


F2Q 


735 


c 




9.934 


40.532 


-8.627 


1.00 


31.88 


2659 


F20 


735 


c 




11.362 


41.644 


-9.892 


1.00 


31.88 
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TABLE 3 

ATOMIC STRUCTURE COORDINATE DATA OBTAINED FROM X-RAY 
DIFFRACTION FROM THE LIGAND BINDING DOMAIN OF PPARq USED IN 

MOLECULAR REPLACEMENT 



ATOM 


ATOM 
TYPE 


RESIDUE 


PROTEIN 

# 

r X 


# 

it 


X 


Y 

JL 


7 


ore 


R 


1 


CB 


ASP 


A 


211 

fa 1 1 


14 51 


5 574 


19 848 
x .oto 


1 00 


78 46 


2 


CG 


ASP 

A m mm mm 


A 


211 


14 259 


4 095 


90 068 


1 00 

I ,\J\J 


70 53 


3 


OD1 


ASP 


A 

A. 1 


211 


13 363 


3 535 




i no 

X .UU 


80 93 


4 


OD2 


ASP 


A 

a X 


211 


14 961 


3 492 




1 00 

X .\J\J 


80 14 

OV/. X*T 


5 


c 


ASP 

A A, *— ■r A. 


A 

a i. 


211 


15 241 

X «/ «fa~ 1 


7 375 


1 8 97? 

1 O.fa /fa 


1 00 


76 33 


6 


O 


ASP 


A 


211 

fa 1 JL 


14 371 

X ~ mmJ 1 X 


8 176 

O. X / V 


17 099 
i / .y^y 


1 00 

X .Uv 


76 50 


7 


N 


ASP 

A A*»^ A> 


A 


211 

fa X X 


16 106 


5 066 


18 029 


1 00 

X •\)\J 


77 36 


8 


CA 


ASP 


A 

* *- 


211 
fail 


14 923 


5 889 


18 409 


1 00 

X .V V 


77 08 


9 


N 


LEU 


A 


212 
fa i fa 


16 49 


7 738 


18 549 


1 00 


74 80 


10 


CA 


LEU 


A 


212 
fa i fa 


16 926 

X W • ✓ fa V 


9 125 

A fa «✓ 


18 438 


1 00 

X .\J\J 


72 96 


11 


CB 


LEU 


A 


212 

mm A m 


18 142 

JL w » JL IAtI 


9 393 


19 328 


1 00 

1 »w 


73 58 


12 


CG 


LEU 

^frfa« ^atf 


A 


212 

mm JL AW 


18 809 

jl Uiuvy 


10 76 


19 098 


1 00 

1 ■ w 


73 64 


13 


CD1 


LEU 


A 


212 


18.094 


11.83 

A A » V 


19.907 


1.00 

A « W V 


73.44 


14 


CD2 


LEU 


A 


212 


20.274 


10.68 


19.492 

A m* • I Arf 


1.00 


73.65 


15 


C 


LEU 


A 


212 


17.315 


9.398 


16.997 


1.00 


71.55 


16 


O 


LEU 


A 


212 


17.304 


10.54 


16.548 


1.00 


71.54 


17 


N 

■ 


LYS 


A 


213 


17.672 


8.346 


16.274 


1.00 


69.51 


18 


CA 


LYS 


A 


213 


18.069 


8.512 


14.89 


1.00 


67.63 


19 


CB 


LYS 


A 


213 


18.837 


7.279 


14.416 


1.00 


68.39 


20 


CG 


LYS 


A 


213 


19.477 


7.423 


13.047 


1.00 


68.94 


21 


CD 


LYS 


A 


213 


20.908 


6.9 


13.051 


1.00 


69.41 


22 


CE 


LYS 


A 


213 


21.036 


5.602 


13.837 


1.00 


69.58 


23 


NZ 


LYS 


A 


213 


22.361 


4.97 


13.652 


1.00 


70.30 


24 


C 


LYS 


A 


213 


16.845 


8.769 


14.018 


1.00 


65.91 


25 


0 


LYS 


A 


213 


16.97 


9.16 


12.856 


1.00 


66.04 


26 


N 


SER 


A 


214 


15.661 


8.558 


14.587 


1.00 


62.82 


27 


CA 


SER 


A 


214 


14.425 


8.817 


13.86 


1.00 


59.48 
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28 


PR 


9FR 


A 
A 


01 /i 
Z14 


1 1 01 a 
13.Z16 


O *)CH 

o.zo/ 


1 A CIA 

14.624 


1 AA 
1.00 


CA CC 

59.65 


29 


OG 


OJZt\. 


A 
A 


0 1 A 
Z 14 


1 0 QQ1 


A 

y 


K OK 

15.815 


1 AA 
1.00 


CA A1 

59.47 




p 




A 
A 


01/1 
Z14 


14.3Z6 


1 A 1 A 

10.34 


1 1 nc 

13.76 


1 AA 
1.00 


CC oo 

56.82 


J JL 


n 




A 
A 


Z14 


1 1 CA1 

13.501 


1 A 0*7 

10.87 


1 1 A1 A 

13.019 


1 AA 

1.00 


CC 

56.27 






t Frr 

LUU 


A 

A 


Z15 


1 C 1 oo 

15.183 


1 1 A1 

11.01 


1 A CO /I 

14.524 


1 A A 

1.00 


53.62 


33 




T FTT 
LUU 


A 

A 


11 C 


1 c 1/1 A 

15.244 


12.47 


14.553 


1 AA 

1.00 


CA 1C 

50.75 






r FTT 


A 
A 


o 1 c 

Zl 5 


1 C ICO 

16.368 


1 1 A1 

12.93 


1 C VIA 

15.49 


1 AA 

1.00 


50.59 


35 




T FTT 


A 
A 


1 1 C 

Z15 


1 c nic 
16. 736 


1 A Af\ 

14.42 


1 C C AO 

15.548 


1 A A 

1.00 


A t\ f\ A 

49.94 


36 


pm 


T FTT 
LUU 


A 
A 


0 1 < 

Z15 


1 C CO 

15.58 


15.23 


16.111 


1 AA 

1.00 


A A 1 O 

49.18 


37 




T FTT 
LUU 


A 
A 


01 < 
Z ID 


1 1 QIC 

i /.y /5 


14.6 


16.414 


1 AA 

1.00 


a a a 

49.63 


38 

JO 


p 


T FTT 
LXIU 


A 
A 


O 1 C 


1 C /IQ/C 

15.496 


13.02 


111 cc 

13.155 


1 A A 

1.00 


A O AO 

48.97 


30 
jy 


l/ 


T FTT 
LtlU 


A 

A 


215 


1 c Ain 

15.029 


1 A 1 

14.1 


1 O OAA 

12.809 


1 AA 

1.00 


A A 1 1 

49. II 


40 


1M 


AT A 
ALA 


A 
A 


O 1 C 

ZlO 


i c oi o 
lo.ZJo 


12.26 


11 iff 

12.355 


1 AA 

1.00 


AC AH 

46.47 


41 


PA 


AT A 

ALA 


A 

A j 


0 1 /£ 

ZlO 


1 C CCQ 

16.553 


lz.66 

« 


1 A AAvl 

10.994 


1 AA 

l.OO 


A C AO 

45.03 


4? 

HZ 


PR 


AT A 
ALA 


A 

A 


Zlo 


1 1 yiOO 

l /.4z3 


1 1 CI 

11.61 


1 A 1T> 

10.322 


1 A A 

1.00 


A A OO 

44.87 


4 J 




AT A 

ALA 


A 

A 


1 1 /T 

216 


15.288 


12.9 


10.169 


1 A A 

l.OO 


A A CO 

44.53 


44 




AT A 
ALA 


A 

A 


1 1 £ 

Zlo 




1 A 

14 


A CO 

9.662 


1 AA 

l.OO 


A O jIO 

43.47 


45 
4J 


XT 
IN 


T VC 


A 

A 


1 1 *7 
Zl / 


1 A A AC 

14.440 


1 1 DO 

11.88 


1 A A/I 1 

10.041 


1 AA 

1.00 


43.76 


46 


v^A 


T VO 


A 

A 


III 


13,219 


12.02 


A O/T/l 

9.264 


1 AA 

1.00 


A o OA 

42.20 


47 
4/ 


PR 


T VC 


A 

A 


1 1 1 
III 


lz.4oo 


1 A Cf\ 

10.69 


A 1A1 

9.203 


1 AA 

LOO 


44.47 


48 


pn 


T VQ 


A 
A 


Zl / 


1 1 

1 1.j3 / 


1 A CQ 


O OOQ 


1 AA 
1.00 


/!C A1 

43.43 


40 


pr> 


T VQ 


A 
A 


117 
Zl / 


1U. /U3 


y.314 


7 OO/l 


1 AA 
1.0U 


AC, AO 




PF 


T VQ 
L X o 


A 
A 


01 7 


y.03y 


Q 1<ZA 

9.3D4 


O AO 

y.oo 


1 AA 
1.00 


/1Q AO 


51 


T\T7 


T 

L X O 


A 
A 


017 
Zl / 


C 771 
o. / / 1 


1/17 
o.Ih/ 


0 /143 

y.u4o 


1 AA 


40 04 


5? 


P 


T V<s 

L I O 


A 
A 


017 
Zl / 


10 ^05 
IZ.jZD 


i ^ no 


0 8^ 
y.oOO 


1 AA 
1 .\J\) 


4.1 10 
HI . Ly 


53 


o 




A 


917 
Z 1 / 


1 1 6^3 
1 1 .0 J J 


1 3 83 

1 J .OJ 


0 145 


i on 


dO 18 

10 


54 


N 


APfr 


A 


918 

Z J. 0 


19 30^ 


13 18 
10.10 


1 1 103 
ll.l >0 


1 no 

1 ,vu 


38 04 


55 


CA 




A 
r\ 


718 

ZIO 


1 1 534 


14 1 8 

1*T. 10 


1 1 001 

1 1 .71/1 


1 no 

1 .1/1/ 


37 9a" 


56 


CB 


ARG 


A 


218 


11 748 


14 03 


13 411 


1 00 


39 88 


57 


CG 


ARG 


A 


218 


11.396 


15.26 


14.24 


1.00 


43.03 


58 


CD 


ARG 


A 


218 


9.903 


15.54 


14.26 


1.00 


45.71 


59 


NE 


ARG 


A 


218 


9.619 


16.9 


14.734 


1.00 


46.98 


60 


CZ 


ARG 


A 


218 


8.399 


17.41 


14.851 


1.00 


47.56 


61 


NH1 


ARG 


A 


218 


7.335 


16.68 


14.531 


1.00 


48.63 
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62 


MHO 


ARG 


A 

A 


71 Q 
ZI5 


0 O/IO 

B.Z4Z 


1 0 ac 


1 c OO 

15.28 


1 AA 

1.00 


A1 AO 

47.03 


63 


p 


ATCG 


A 
A 


71 Q 
Zlo 


1 1 Q/1Q 


1 C CO 
1 J.JO 


11 AAA 

1 1.444 


1 AA 

1.00 


1 a 0*7 

34.87 


64 


o 


ARG 


A 
A 


71 R 


111 no 

i i.iuy 


1D.39 


I 1 ACO 

I I .U5Z 


1 AA 
l.UU 


33.63 


65 


N 


TT P 


A 


710 

ziy 


1 1 1/1*7 

13.Z4/ 


1 C Q/i 
1 J. 50 


1 1.5 


1 AA 

1.0U 


31.60 


66 


PA 


TT P 


A 
A 


71 Q 

ziy 


13. /oZ 


1 /.lo 


1 1 f\OA 

11.U84 


1 AA 

1.00 


IO vl C 

28.45 


67 




TT P 


A 
A 


7 1 Q 

ziy | 


i r oqc 

Ij.ZoO 


1 H 1*7 
1 /.27 


11.356 


1 AA 

1.00 


27.35 


68 


PG9 


TT P 


A 


01 O 

ziy 


1 C Q.HA 

15.8/4 


1 O AH 

lo.4o 


1 A 

1U.625 


1 AA 

1.00 


26.86 




pgi 


TT P 


A 


ziy 


15.53 


1 h in 

17.39 


12.863 


1 AA 

1.00 


n *> 1 

2132 


70 
/ U 


PFi1 


TT T? 


A 

A 


o 1 a 

zi9 


16.997 


17.3 


1 <*> CO 

13.258 


LOO 


26.22 


71 


n 


TT P 


A 

A 


zi9 


13.486 


17.35 


A CAT 

9.597 


1 A A 

l.OO 


27.44 


79 


w 


TT P 


A 

A 


71 Q 

Ziy 


111 ci 
13.153 


i O /If 

10.45 


A 1 CI 

9.153 


1 AA 

1.00 


26.71 


71 


A] 
IN 


1 x K 


A 

A 


ooa 
ZZU 


13.619 


16.28 


O O A 

8.834 


1 A A 

LOO 


25.68 


74 


PA 


TVD 
1 YK 


A 
A 


ZZU 


1 1 11 A 

13.3 /4 


15.33 


7.4U2 


1 AA 

LOO 


26.73 


7S 


PR 


TVD 


A 

A 


OOA 


1 1 OO/C 

13. /oo 


1 C A1 

15.U1 


c n a 0 

6.748 


1 AA 

1.00 


25.17 


If. 


pg 


TVD 

1 YK 


A 

A 


ooa 
ZZU 


13.663 


14.97 


5.236 


1 AA 

LOO 


0*7 AO 

27.92 


77 


V-JU1 


TVD 


A 

A 


zzU 


1 1 oca 

13.556 


16.12 


4.465 


1 A A 

LOO 


OT 1 A 

27.10 


7R 
/ o 




1 IK 


A 

A 


ZZU 


13./ /9 


16.U7 


3.074 


1 A A 

LOO 


OO /I A 

27.40 


7Q 

iy 




TVD 
1 YK 


A 

A 


OOA 

ZZU 


1 1 QA1 

13.391 


13.77 


4.57Z 


1 AA 

LOU 


OO OA 

Z8.ZU 


RO 


PP7 


1 YK 


A 

A 


Tin 
ZZU 


1 1 1 1 £. 

13.316 


13.72 


3.18 


1 AA 

LOO 


OA Cf\ 

29.6U 


R1 




TVD 
1 YK 


A 

A 


ZZU 


13.J1Z 


1 /i 00 
14.00 


Z.438 


1 AA 
l.UU 


OO HI 
Z8.6/ 


R9 




TVD 
1 YK 


A 
A 


77A 
ZZU 


1 1 /id 
13.4D3 


1 A 0*2 

14.63 


1 A#C7 
1.U0Z 


1 AA 
l.UU 


7A 77 

3U.ZZ 


R3 


v., 


TVD 
1 IK 


A 
A 


77ft 
ZZU 


1 1 QO< 
1 l.O^J 


10.04 


7 1 CC 
/.l JJ 


1 AA 
l.UU 


7/i 1< 


R4 


n 


TVD 
1 IK 


A 
A 


77fk 
ZZU 


1 l.J D4 


1 7 ill 
1 / .41 


O.ZOJ 


1 ftfi 

1 .uu 


77 04 


O J 


M 

IN 


GT TT 


A 
A 


771 
ZZ1 


ii nn 

1 l.Ul D 


1 £ OA 


7 OAR 


1 nn 
1 .uu 


77 41 


86 


PA 


GT TT 


A 
A 


991 


7.J07 


10. zy 


7 77^ 


1 no 


97 IS 


87 


PR 


GT TT 


A 

A 


991 


o. /O 


1 ^ ^0 


R fi04 


1 Oft 


90 ^7 




PG 


GT TT 


A 
A 


991 
zz i 


R R71 


1^ 09 


R ^17 
O.J 1 / 


1 no 


^7 79 

jZ. /Z 


89 


CD 


GT TT 


A 


991 

ZZ 1 


7 040 


1^ 01 


Q 1 10 
y. 1 iy 


i no 


^4 Sfi 


90 


0E1 


GLU 


A 


221 


8 047 


13 


10 363 


1 00 


36 22 


91 


0E2 


GLU 


A 


221 


7.127 


12.31 


8.498 


1.00 


36.44 


92 


C 


GLU 


A 


221 


9.301 


17.77 


8.056 


1.00 


25.89 


93 


0 


GLU 


A 


221 


8.54 


18.4 


7.329 


1.00 


25.26 


94 


N 


ALA 


A 


222 


9.929 


18.3 


9.099 


1.00 


23.75 


95 


CA 


ALA 


A 


222 
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It will be understood that various details of the invention may be changed without 
departing from the scope of the invention. Furthermore, the foregoing description is for the 
purpose of illustration only, and not for the purpose of limitation — the invention being 

■ 

defined by the claims. 
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CLAIMS 

What is claimed is: 

1. A substantially pure PPARa ligand binding domain polypeptide 
in crystalline form. 

2. The polypeptide of claim 1, wherein the crystalline form has 
lattice constants of a = 61.3 A, b = 103.5 A, c = 49.9 A, a = 90°, p = 90°, y = 
90°. 

3. The polypeptide of claim 1 or 2, wherein the crystalline form is 
an orthorhombic crystalline form. 

■ 

4. The polypeptide of claim 1 or 2, wherein the crystalline form has 
a space group of P2 1 2i2. 

5. The polypeptide of claim 1 or 2, wherein the PPARa ligand 
binding domain polypeptide has the amino acid sequence shown in SEQ ID 
NO: 4. 

6. The polypeptide of claim 1 or 2, wherein the PPARa ligand 
binding domain polypeptide is in complex with a ligand. 

7. The polypeptide of claim 1 or 2, wherein the PPARa ligand 
binding domain has a crystalline structure further characterized by the 
coordinates corresponding to Table 2. 

8. The polypeptide of claim 1 or 2, wherein the crystalline form 
contains one PPARa ligand binding domain polypeptide in the asymmetric 
unit. 

9. The polypeptide of claim 1 or 2, wherein the crystalline form is 
such that the three-dimensional structure of the crystallized PPARa ligand 
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binding domain polypeptide can be determined to a resolution of about 1.8 A 
or better. 

10. The polypeptide of claim 1 or 2, wherein the crystalline form 
contains one or more atoms having a molecular weight of 40 grams/mol or 
greater. 

11. The polypeptide of claim 1, wherein the crystalline form has 
lattice constants of a = 95.58 A, b = 122.06 A, c = 122.10 A, a = 90°, p = 90°, 
Y = 90°. 

12. The polypeptide of claim 1 or 11, wherein the crystalline form is 
an orthorhombic crystalline form. 

13. The polypeptide of claim 1 or 11, wherein the crystalline form 
has a space group of P2 1 2i2 i . 

14. The polypeptide of claim 1 or 11, wherein the PPARcc ligand 
binding domain polypeptide has the amino acid sequence shown in SEQ ID 
NO: 4. 

15. The polypeptide of claim 1 or 11, wherein the crystalline form 
contains four PPARa ligand binding domain polypeptides in the asymmetric 
unit. 

16. The polypeptide of claim 1 or 11, wherein the crystalline form is 
such that the three-dimensional structure of the crystallized PPARa ligand 
binding domain polypeptide can be determined to a resolution of about 2.5 A 
or better. 
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17. The polypeptide of claim 1 or 11, wherein the crystalline form 
contains one or more atoms having a molecular weight of 40 grams/mol or 
greater. 

18. A method for determining the three-dimensional structure of a 
crystallized PPARa ligand binding domain polypeptide to a resolution of about 
1 .8 A or better, the method comprising: 

(a) crystallizing a PPARa ligand binding domain polypeptide; and 

(b) analyzing the PPARa ligand binding domain polypeptide to 
determine the three-dimensional structure of the crystallized 
PPARa ligand binding domain polypeptide, whereby the three- 
dimensional structure of a crystallized PPARa ligand binding 
domain polypeptide is determined to a resolution of about 1 .8 A 
or better. 



19. The method of claim 18, wherein the analyzing is by X-ray 
diffraction. 



20. The method of claim 18, wherein the crystallization is 
accomplished by the hanging drop vapor diffusion method, and wherein the 
PPARa ligand binding domain is mixed with an equal volume of reservoir, 

21. The method of claim 20, wherein the reservoir comprises 4-8% 
PEG 3350, 100-200mM NaF, and 12-16% 2,5 hexanediol. 

22. The method of claim 20, wherein the reservoir comprises 50 mM 
bis-tris-propane, 4-6% PEG 3350, 150 mM NaN0 3 , 16% 2,5 hexanediol and 
1-3 mM YCI. 



23. A method of generating a crystallized PPARa ligand binding 
domain polypeptide, the method comprising: 
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(a) incubating a solution comprising a PPARa ligand binding 
domain with an equal volume of reservoir; and 

(b) crystallizing the PPARa ligand binding domain polypeptide using 
the hanging drop method, whereby a crystallized PPARa ligand 
binding domain polypeptide is generated. 

24. A crystallized PPARa ligand binding domain polypeptide 
produced by the method of claim 23. 

25. A method of designing a modulator of a PPAR polypeptide, the 
method comprising: 

(a) designing a potential modulator of a PPAR polypeptide that will 
make interactions with amino acids in the ligand binding site 
based upon a crystalline structure of a PPARa ligand binding 
domain polypeptide; 

(b) synthesizing the modulator; and 

(c) determining whether the potential modulator modulates the 
activity of the PPAR polypeptide, whereby a modulator of a 
PPAR polypeptide is designed. 

26. A method of designing a modulator that selectively modulates 
the activity of a PPAR polypeptide the method comprising: 

(a) obtaining a crystalline form of a PPARa ligand binding domain 
polypeptide; 

(b) evaluating the three-dimensional structure of the crystallized 
PPARa ligand binding domain polypeptide; and 

(c) synthesizing a potential modulator based on the three- 
dimensional crystal structure of the crystallized PPARa ligand 
binding domain polypeptide, whereby a modulator that 
selectively modulates the activity of a PPARa polypeptide is 
designed. 
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27. The method of claim 26, wherein the method further comprises 
contacting a PPARa ligand binding domain polypeptide with the potential 
modulator; and assaying the PPARa ligand binding domain polypeptide for 
binding of the potential modulator, for a change in activity of the PPARa ligand 
binding domain polypeptide, or both. 

28. The method of claim 26, wherein the crystalline form is in 
orthorhombic form. 

29. The method of claim 28, wherein the crystalline form is such that 
the three-dimensional structure of the crystallized PPARa ligand binding 
domain polypeptide can be determined to a resolution of about 1.8 A or better. 

30. A method of screening a plurality of compounds for a modulator 
of a PPAR ligand binding domain polypeptide, the method comprising: 

(a) providing a library of test samples; 

(b) contacting a crystalline PPARa ligand binding domain 
polypeptide with each test sample; 

(c) detecting an interaction between a test sample and the 
crystalline PPARa ligand binding domain polypeptide; 

(d) identifying a test sample that interacts with the crystalline 
PPARa ligand binding domain polypeptide; and 

(e) isolating a test sample that interacts with the crystalline PPARa 
ligand binding domain polypeptide, whereby a plurality of 
compounds is screened for a modulator of a PPAR ligand 
binding domain polypeptide. 

31. The method of claim 30, wherein the test samples are bound to 
a substrate. 

32. The method of claim 30, wherein the test samples are 
synthesized directly on a substrate. 
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33. A method for identifying a PPAR modulator, the method 
comprising: 

(a) providing atomic coordinates of a PPARct ligand binding domain 
to a computerized modeling system;and 

(b) modeling ligands that fit spatially into the binding pocket of the 
PPARa ligand binding domain to thereby identify a PPAR 
modulator, whereby a PPAR modulator is identified. 

34. The method of claim 33, wherein the method further comprises 
identifying in an assay for PPAR-mediated activity a modeled ligand which 
increases or decreases the activity of the PPAR. 

35. A method of identifying a PPARa modulator that selectively 
modulates the activity of a PPARa polypeptide compared to other 
polypeptides, the method comprising: 

(a) providing atomic coordinates of a PPARa ligand binding domain 
to a pomputerized modeling system; and 

(b) modeling a ligand that fits into the binding pocket of a PPARa 
ligand binding domain and that interacts with conformationally 
constrained residues of a PPARa conserved among PPAR 
subtypes, whereby a PPARa modulator that selectively 
modulates the activity of a PPARa polypeptide compared to 
other polypeptides. 

36. The method of claim 35, wherein the method further comprises 
identifying in a biological assay for PPARa activity a modeled ligand that 
selectively binds to said PPARa and increases or decreases the activity of 
said PPARa. 

37. A method of designing a modulator of a PPAR polypeptide, the 
method comprising: 

(a) selecting a candidate PPAR ligand; 
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(b) determining which amino acid or amino acids of a PPAR 
polypeptide interact with the ligand using a three-dimensional 
model of a crystallized protein comprising a PPARa LBD; 

(c) identifying in a biological assay for PPAR activity a degree to 
which the ligand modulates the activity of the PPAR polypeptide; 

(d) selecting a chemical modification of the ligand wherein the 
interaction between the amino acids of the PPAR polypeptide 
and the ligand is predicted to be modulated by the chemical, 
modification; 

(e) performing the chemical modification on the ligand to form a 
modified ligand; 

(f) contacting the modified ligand with the PPAR polypeptide; 

(g) identifying in a biological assay for PPAR activity a degree to 
which the modified ligand modulates the biological activity of the 
PPAR polypeptide; and 

(h) comparing the biological activity of the PPAR polypeptide in the 
presence of modified ligand with the biological activity of the 
PPAR polypeptide in the presence of the unmodified ligand, 
whereby a modulator of a PPAR polypeptide is designed. 

38. The method of claim 37, wherein the PPAR polypeptide is a 
PPARa polypeptide. 

39. The method of claim 37, wherein the three-dimensional model of 
a crystallized protein is a PPARa LBD polypeptide with a bound ligand. 

40. The method of claim 37, wherein the method further comprises 
repeating steps (a) through (f), if the biological activity of the PPAR 
polypeptide in the presence of the modified ligand varies from the biological 
activity of the PPAR polypeptide in the presence of the unmodified ligand. 
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41 . An assay method for identifying a compound that inhibits binding 
of a ligand to a PPAR polypeptide, the assay method comprising: 

(a) incubating a PPAR polypeptide with a ligand in the presence of 
a test inhibitor compound; 

(b) determining an amount of ligand that is bound to the PPAR 
polypeptide, wherein decreased binding of ligand to the PPAR 
protein in the presence of the test inhibitor compound relative to 
binding of ligand in the absence of the test inhibitor compound is 
indicative of inhibition; and 

(c) identifying the test compound as an inhibitor of ligand binding if 
decreased ligand binding is observed, whereby a compound that 
inhibits binding of a ligand to a PPAR polypeptide is identified. 

42. A method of identifying a PPAR modulator that selectively 
modulates the biological activity of one PPAR subtype compared to PPARa, 
the method comprising: 

(a) providing an atomic structure coordinate set describing a 
PPARa ligand binding domain structure and at least one other 
atomic structure coordinate set describing a PPAR ligand 
binding domain, each ligand binding domain comprising a ligand 
binding site; 

(b) comparing the PPAR atomic structure coordinate sets to identify 
at least one diference between the sets; 

(c) designing a candidate ligand predicted to interact with the 
difference of step (b); 

(d) synthesizing the candidate ligand; and 

(e) testing the synthesized candidate ligand for an ability to 

♦ 

selectively modulate a PPAR subtype as compared to PPARa, 
whereby a PPAR modulator that selectively modulates the 
biological activity of one PPAR subtype compared to PPARa is 
identified. 
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<212> DNA 
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<220> 
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<221> CDS 

<222> (124) . . (1527) 



<300> 

<308> Genbank/S7434 9 

<309> 1995-04-12 

<313> (1)..(1681) 



<400> 1 

gttctggagg ctgggaagtt caagatcaaa gtgccagcag attcagtgtc atgtgaggac 60 

gtgcttcctg cttcatagat aagagcttgg agctcggcgc acaaccagca ccatctggtc 120 

gcg atg gtg gac acg gaa age cca etc tgc ccc etc tec cca etc gag 168 
Met Val Asp Thr Glu Ser Pro Leu Cys Pro Leu Ser Pro Leu Glu 
15 10 15 

gee ggc gat eta gag age cog tta tct gaa gag ttc ctg caa gaa atg 216 
Ala Gly Asp Leu Glu Ser Pro Leu Ser Glu Glu Phe Leu Gin Glu Met 

20 25 30 

gga aac ate caa gag att teg caa tec ate ggc gag gat agt tct gga 264 
Gly Asn lie Gin Glu lie Ser Gin Ser lie Gly Glu Asp Ser Ser Gly 

35 40 45 

age ttt ggc ttt acg gaa tac cag tat tta gga age tgt cct ggc tea 312 
Ser Phe Gly Phe Thr Glu Tyr Gin Tyr Leu Gly Ser Cys Pro Gly Ser 
50 55 60 

gat ggc teg gtc ate acg gac acg ctt tea cca get teg age ccc tec 360 
Asp Gly Ser Val lie Thr Asp Thr Leu Ser Pro Ala Ser Ser Pro Ser 
65 70 75 



teg gtg act tat cct gtg gtc ccc ggc age gtg gac gag tct ccc agt 



408 
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Ser Val Thr Tyr Pro Val Val Pro Gly Ser Val Asp Glu Ser Pro Ser 
80 85 90 95 

gga gca ttg aac ate gaa tgt aga ate tgc ggg gac aag gee tea ggc 456 
Gly Ala Leu Asn lie Glu Cys Arg lie Cys Gly Asp Lys Ala Ser Gly 

100 105 110 

tat cat tac gga gtc cac gcg tgt gaa ggc tgc aag ggc ttc ttt egg 504 
Tyr His Tyr Gly Val His Ala Cys Glu Gly Cys Lys Gly Phe Phe Arg 

115 120 125 

cga acg att cga etc aag ctg gtg tat gac aag tgc gac cgc age tgc 552 
Arg Thr lie Arg Leu Lys Leu Val Tyr Asp Lys Cys Asp Arg Ser Cys 
130 135 140 

aag ate cag aaa aag aac aga aac aaa tgc cag tat tgt cga ttt cac 600 
Lys lie Gin Lys Lys Asn Arg Asn Lys Cys Gin Tyr Cys Arg Phe His 
145 150 155 

aag tgc ctt tct gtc ggg atg tea cac aac gcg att cgt ttt gga cga 648 
Lys Cys Leu Ser Val Gly Met Ser His Asn Ala lie Arg Phe Gly Arg 
160 165 170 175 

atg cca aga tct gag aaa gca aaa ctg aaa gca gaa att ctt ace tgt 696 
Met Pro Arg Ser Glu Lys Ala Lys Leu Lys Ala Glu lie Leu Thr Cys 

180 185 190 

» 

gaa cat gac ata gaa gat tct gaa act gca gat etc aaa tct ctg gee 744 
Glu His Asp lie Glu Asp Ser Glu Thr Ala Asp Leu Lys Ser Leu Ala 

195 200 205 

aag aga ate tac gag gee tac ttg aag aac ttc aac atg aac aag gtc 792 
Lys Arg lie Tyr Glu Ala Tyr Leu Lys Asn Phe Asn Met Asn Lys Val 
210 215 220 

aaa gee egg gtc ate etc tea gga aag gee agt aac aat cca cct ttt 840 
Lys Ala Arg Val lie Leu Ser Gly Lys Ala Ser Asn Asn Pro Pro Phe 
225 230 235 

gtc ata cat gat atg gag aca ctg tgt atg get gag aag acg ctg gtg 888 

Val He His Asp Met Glu Thr Leu Cys Met Ala Glu Lys Thr Leu Val 

240 245 250 255 

gee aag ctg gtg gee aat ggc ate cag aac aag gag gcg gag gtc cgc 936 

Ala Lys Leu Val Ala Asn Gly lie Gin Asn Lys Glu Ala Glu Val Arg 
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260 265 2.70 

ate ttt cac tgc tgc cag tgc acg tea gtg gag acc gtc acg gag etc 984 
He Phe His Cys Cys Gin Cys Thr Ser Val Glu Thr Val Thr Glu Leu 

275 280 285 

acg gaa ttc gee aag gee ate cca ggc ttc gca aac ttg gac ctg aac 1032 
Thr Glu Phe Ala Lys Ala lie Pro Gly Phe Ala Asn Leu Asp Leu Asn 
290 295 300 

gat caa gtg aca ttg eta aaa tac gga gtt tat gag gec ata ttc gec 1080 
Asp Gin Val Thr Leu Leu Lys Tyr Gly Val Tyr Glu Ala He Phe Ala 
305 310 315 

atg ctg tct tct gtg atg aac aaa gac ggg atg ctg gta gcg tat gga 1128 
Met Leu Ser Ser Val Met Asn Lys Asp Gly Met Leu Val Ala Tyr Gly 
320 325 330 335 

aat ggg ttt ata act cgt gaa ttc eta aaa age eta agg aaa ccg ttc 117 6 
Asn Gly Phe He Thr Arg Glu Phe Leu Lys Ser Leu Arg Lys Pro Phe 

340 345 350 

tgt gat ate atg gaa ccc aag ttt gat ttt gec atg aag ttc aat gca 1224 
Cys Asp He Met Glu Pro Lys Phe Asp Phe Ala Met Lys Phe Asn Ala 

355 360 365 

ctg gaa ctg gat gac agt gat ate tec ctt ttt gtg get get ate att 1272 
Leu Glu Leu Asp Asp Ser Asp He Ser Leu Phe Val Ala Ala He He 
370 375 380 



tgc tgt gga gat cgt cct 
Cys Cys Gly Asp Arg Pro 
385 

atg cag gag ggt att gta 
Met Gin Glu Gly He Val 
400 405 

cac ccg gac gat ate ttt 
His Pro Asp Asp He Phe 

420 

gac etc egg cag ctg gtg 
Asp Leu Arg Gin Leu Val 



ggc ctt eta aac gta gga 
Gly Leu Leu Asn Val Gly 
390 395 

cat gtg etc aga etc cac 
His Val Leu Arg Leu His 

410 

etc ttc cca aaa ctt ctt 
Leu Phe Pro Lys Leu Leu 

425 

acg gag cat gcg cag ctg 
Thr Glu His Ala Gin Leu 



cac att gaa aaa 1320 
His lie Glu Lys 

ctg cag age aac 1368 
Leu Gin Ser Asn 

415 

caa aaa atg gca 1416 
Gin Lys Met Ala 
430 

gtg cag ate ate 1464 
Val Gin He He 
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435 440 445 

aag aag acg gag teg gat get gcg ctg cac ccg eta ctg cag gag ate 1512 
Lys Lys Thr Glu Ser Asp Ala Ala Leu His Pro Leu Leu Gin Glu lie 

450 455 460 

tac agg gac atg tac tgagttcctt cagatcagcc acaccttttc caggagttct 1567 
Tyr Arg Asp Met Tyr 
. 465 

gaagctgaca gcactacaaa ggagacgggg gagcagcacg attttgeaca aatatccacc 1627 
actttaacct tagagcttgg acagtctgag ctgtaggtaa ceggcatatt attccatatc 1687 
tttgttttaa ccagtacttc taagagcata gaactcaaat gctg 1731 



<210> 2 

<211> 468 

<212> PRT 

<213> Homo sapiens 



<400> 2 

Met Val Asp Thr Glu Ser Pro Leu Cys Pro Leu Ser Pro Leu Glu Ala 
15 10 15 

Gly Asp Leu Glu Ser Pro Leu Ser Glu Glu Phe Leu Gin Glu Met Gly 

20 25 30 

Asn He Gin Glu He Ser Gin Ser He Gly Glu Asp Ser Ser Gly Ser 
35 40 45 

Phe Gly Phe Thr Glu Tyr Gin Tyr Leu Gly Ser Cys Pro Gly Ser Asp 
50 55 60 



Gly Ser Val He Thr Asp Thr Leu Ser Pro Ala Ser Ser Pro Ser Ser 
65 70 75 80 
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Val Thr Tyr Pro Val Val 

85 

Ala Leu Asn lie Glu Cys 

100 



His Tyr Gly Val His Ala 
115 

Thr lie Arg Leu Lys Leu 
130 

lie Gin Lys Lys Asn Arg 

145 150 

Cys Leu Ser Val Gly Met 

165 

Pro Arg Ser Glu Lys Ala 

180 

His Asp He Glu Asp Ser 
195 

Arg He Tyr Glu Ala Tyr 
210 

Ala Arg Val He Leu Ser 
225 230 

He His Asp Met Glu Thr 

245 

Lys Leu Val Ala Asn Gly 

260- 



-6- 

Pro Gly Ser Val Asp Glu 

90 

Arg He Cys Gly Asp Lys 
105 



Cys Glu Gly Cys Lys Gly 
120 

Val Tyr Asp Lys Cys Asp 
135 140 

Asn Lys Cys Gin Tyr Cys 

155 

Ser His Asn Ala He Arg 

170 

Lys Leu Lys Ala Glu He 
185 

Glu Thr Ala Asp Leu Lys 
200 

Leu Lys Asn' Phe Asn Met 
215 220 

Gly Lys Ala Ser Asn Asn 

235 

Leu Cys Met Ala Glu Lys 

250 

He Gin Asn Lys Glu Ala 
265 
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Ser Pro Ser Gly 
95 

Ala Ser Gly Tyr 
110 



Phe Phe Arg Arg 
125 

Arg Ser Cys Lys 



Arg Phe His Lys 

160 

Phe Gly Arg Met 
175 

Leu Thr Cys Glu 
190 

Ser Leu Ala Lys 
205 

Asn Lys Val Lys 



Pro Pro Phe Val 

240 

Thr Leu Val Ala 
255 

Glu Val Arg He 
270 



Phe His Cys Cys Gin Cys Thr Ser Val Glu Thr Val Thr Glu Leu Thr 
275 280 285 
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Glu Phe Ala Lys 
290 

Gin Val Thr Leu 
305 

Leu Ser Ser Val 



Gly Phe lie Thr 

340 

Asp lie Met Glu 
355 

Glu Leu Asp Asp 
370 

Cys Gly Asp Arg 
385 

Gin Glu Gly lie 



Pro Asp Asp lie 

420 

Leu Arg Gin Leu 
435 

Lys Thr Glu Ser 
450 

Arg Asp Met Tyr 
465 



Ala He Pro Gly 
295 

Leu Lys Tyr Gly 
310 

Met Asn Lys Asp 
325 

Arg Glu Phe Leu 



Pro Lys Phe Asp 

360 

Ser Asp He Ser 
375 

Pro Gly Leu Leu 
390 

Val His Val Leu 
405 

Phe Leu Phe Pro 



Val Thr Glu His 

440 

Asp Ala Ala Leu 
455 



Phe Ala Asn Leu 

300 

Val Tyr Glu Ala 
315 

Gly Met Leu Val 
330 

Lys Ser Leu Arg 
345 

Phe Ala Met Lys 



Leu Phe Val Ala 

380 

Asn Val Gly His 
395 

Arg Leu His Leu 
410 

Lys Leu Leu Gin 
425 

Ala Gin Leu Val 



His Pro Leu Leu 

460 



Asp Leu Asn Asp 



He Phe Ala Met 

320 

Ala Tyr Gly Asn 
335 

Lys Pro Phe Cys 
350 

Phe Asn Ala Leu 
365 

Ala He He Cys 



He Glu Lys Met 

400 

Gin Ser Asn His 
415 

Lys Met Ala Asp 
430 

Gin He He Lys 
445 

Gin Glu He Tyr 



<210> 3 
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<211> 867 



<212> DNA 



<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1)..(867) 

<223> Residues 192-4 68, preceded by His tag 
<300> 

<308> Genbank/S74 34 9 

<309> 1995-04-12 

<313> (192) . . (468) 

<400> 3 

atg aaa aaa ggt cat cat cat cat cat cat ggt gaa cac gac ctg aaa 48 

Met Lys Lys Gly His His His His His His Gly Glu His Asp Leu Lys 
15 10 15 

gat teg gaa act gca gac etc aaa tct ctg ggc aag aga ate cac gaa 96 
Asp Ser Glu Thr Ala Asp Leu Lys Ser Leu Gly Lys Arg lie His Glu 



20 



25 



30 



gec tac ctg aag 
Ala Tyr Leu Lys 



aac ttc aac atg aac aag gtc aag gec egg gtc ata 
Asn Phe Asn Met Asn Lys Val Lys Ala Arg Val lie 



144 
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35 40 45 



etc gcg gga aag acc age aac aac ccg cct ttt gtc ata cat gac atg 192 
Leu Ala Gly Lys Thr Ser Asn Asn Pro Pro Phe Val lie His Asp Met 
50 55 60 

gag acc ttg tgt atg gec gag aag acg ctt gtg gec aag atg gtg gec 240 
Glu Thr Leu Cys Met Ala Glu Lys Thr Leu Val Ala Lys Met Val Ala 
65 70 75 80 

aac ggc gtc gaa gac aaa gag gca gag gtc cga ttc ttc cac tgc tgc 288 
Asn Gly Val Glu Asp Lys Glu Ala Glu Val Arg Phe Phe His Cys Cys 

85 90 95 

cag tgc atg tec gtg gag acc gtc acg gag etc aca gaa ttt gec aag 336 
Gin Cys Met Ser Val Glu Thr Val Thr Glu Leu Thr Glu Phe Ala Lys 

100 105 110 

get ate cca ggc ttt gca aac ttg gac ttg aac gac caa gtc acc ttg 384 
Ala He Pro Gly Phe Ala Asn Leu Asp Leu Asn Asp Gin Val Thr Leu 
115 120 125 

eta aag tac ggt gtg tat gaa gee ate ttc acg atg ctg tec tec ttg 432 

Leu Lys Tyr Gly Val Tyr Glu Ala He Phe Thr Met Leu Ser Ser Leu 
130 135 140 

4 

atg aac aaa gac ggg atg ctg ate gcg tac ggc aat ggc ttt ate aca 480 

Met Asn Lys Asp Gly Met Leu He Ala Tyr Gly Asn Gly Phe He Thr 
145 150 155 160 

cgc gag ttc ctt aag aac ctg agg aag ccg ttc tgt gac ate atg gaa 528 
Arg Glu Phe Leu Lys Asn Leu Arg Lys Pro Phe Cys Asp He Met Glu 

165 170 175 

ccc aag ttt gac ttc get atg aag ttc aat gec tta gaa ctg gat gac 576 
Pro Lys Phe Asp Phe Ala Met Lys Phe Asn Ala Leu Glu Leu Asp Asp 

180 185 190 

agt gac att tec ctg ttt gtg get get ata att tgc tgt gga gat egg 624 
Ser Asp He Ser Leu Phe Val Ala Ala He He Cys Cys Gly Asp Arg 
195 200 205 



cct ggc ctt eta aac ata ggc tac att gag aag ttg cag gag ggg att 



672 
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Pro Gly Leu Leu Asn lie Gly Tyr lie Glu Lys Leu Gin Glu Gly lie 
210 215 220 

gtg cac gtg ctt aag etc cac ctg cag age aac cat cca gat gac acc 720 

Val His Val Leu Lys Leu His Leu Gin Ser Asn His Pro Asp Asp Thr 
225 230 235 240 

ttc etc ttc cca aag etc ctt caa aaa atg gtg gac ctt egg cag ctg 768 

Phe Leu Phe Pro Lys Leu Leu Gin Lys Met Val Asp Leu Arg Gin Leu 

245 250 255 



gtc acg gag cat gcg cag etc gta cag gtc ate aag aag acc gag tec 816 
Val Thr Glu His Ala Gin Leu Val Gin Val lie Lys Lys Thr Glu Ser 

260 265 270 

gac gca gcg ctg cac cca ctg ttg caa gag ate tac aga gac atg tac 864 
Asp Ala Ala Leu His Pro Leu Leu Gin Glu lie Tyr Arg Asp Met Tyr 
275 280 285 

taa 867 



<210> 4 
<211> 288 
<212> PRT 



<213> Homo sapiens 



<400> 4 

Met Lys Lys Gly 
1 

Asp Ser Glu Thr 

20 

Ala Tyr Leu Lys 
35 



His His His His 
5 

Ala Asp Leu Lys 

Asn Phe Asn Met 

40 



His His Gly Glu 
10 

Ser Leu Gly Lys 
25 

Asn Lys Val Lys 



His Asp Leu Lys 
15 

Arg He His Glu 
30 

Ala Arg Val He 
45 
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Leu Ala Gly Lys Thr Ser Asn Asn Pro Pro Phe Val lie His Asp Met 
50 55 60 

Glu Thr Leu Cys Met Ala Glu Lys Thr Leu Val Ala Lys Met Val Ala 
65 70 75 80 

Asn Gly Val Glu Asp Lys Glu Ala Glu Val Arg Phe Phe His Cys Cys 

85 90 95 

Gin Cys Met Ser Val Glu Thr Val Thr Glu Leu Thr Glu Phe Ala Lys 

100 105 110 

Ala lie Pro Gly Phe Ala Asn Leu Asp Leu Asn Asp Gin Val Thr Leu 

115 120 125 

Leu Lys Tyr Gly Val Tyr Glu Ala lie Phe Thr Met Leu Ser Ser Leu 
130 135 , 140 

Met Asn Lys Asp Gly Met Leu He Ala Tyr Gly Asn Gly Phe He Thr 
145 150 155 160 

Arg Glu Phe Leu Lys Asn Leu Arg Lys Pro Phe Cys Asp He Met Glu 

165 170 175 

Pro Lys Phe Asp Phe Ala Met Lys Phe Asn Ala Leu Glu Leu Asp Asp 

180 185 190 

Ser Asp He Ser Leu Phe Val Ala Ala He He Cys Cys Gly Asp Arg 
195 200 205 

Pro Gly Leu Leu Asn He Gly Tyr He Glu Lys Leu Gin Glu Gly He 
210 215 220 

Val His Val Leu Lys Leu His Leu Gin Ser Asn His Pro Asp Asp Thr 
225 230 235 240 



Phe Leu Phe Pro Lys Leu Leu Gin Lys Met Val Asp Leu Arg Gin Leu 

245 250 255 

Val Thr Glu His Ala Gin Leu Val Gin Val He Lys Lys Thr Glu Ser 

260 265 270 
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Asp Ala Ala Leu His Pro Leu Leu Gin Glu He Tyr Arg Asp Met Tyr 
275 280 285 

<210> 5 

<211> 2961 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (338) . . (1660) 
<300> 

<308> Genbank/L07592 

<309> 1993-04-27 

<313> (338) . . (1660) 



<400> 5 

gaattctgcg gagcctgcgg gacggcggcg ggttggcccg taggcagccg ggacagtgtt 60 

gtacagtgtt ttgggcatgc acgtgatact cacacagtgg cttctgctca ccaacagatg 120 

aagacagatg caccaacgag ggtctggaat ggtctggagt ggtctggaaa gcagggtcag 180 

atacccctgg aaaactgaag cccgtggagc aatgatctct acaggactgc ttcaaggctg 240 
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atgggaacca ccctgtagag gtccatctgc gttcagaccc agacgatgcc agagctatga 300 

ctgggcctgc aggtgtggcg ccgaggggag atcagcc atg gag cag cca cag gag 355 

Met Glu Gin Pro Gin Glu 
1 5 

gaa gcc cct gag gtc egg gaa gag gag gag aaa gag gaa gtg gca gag 403 
Glu Ala Pro Glu Val Arg Glu Glu Glu Glu Lys Glu Glu Val Ala Glu 

10 15 20 

gca gaa gga gcc cca gag etc aat ggg gga cca cag cat gca ctt cct 451 
Ala Glu Gly Ala Pro Glu Leu Asn Gly Gly Pro Gin His Ala Leu Pro 
25 30 35 

tec age age tac aca gac etc tec egg age tec teg cca ccc tea ctg 4 99 

Ser Ser Ser Tyr Thr Asp Leu Ser Arg Ser Ser Ser Pro Pro Ser Leu 
40 45 50 

ctg gac caa ctg cag atg ggc tgt gac ggg gcc tea tgc ggc age etc 547 
Leu Asp Gin Leu Gin Met Gly Cys Asp Gly Ala Ser Cys Gly Ser Leu 
55 60 65 70 

aac atg gag tgc egg gtg tgc ggg gac aag gca teg ggc ttc cac tac 595 
Asn Met Glu Cys Arg Val Cys Gly Asp Lys Ala Ser Gly Phe His Tyr 

75 80 85 

ggt gtt cat gca tgt gag ggg tgc aag ggc ttc ttc cgt cgt acg ate 643 
Gly Val His Ala Cys Glu Gly Cys Lys Gly Phe Phe Arg Arg Thr lie 

90 95 100 



cgc atg aag ctg gag tac 
Arg Met Lys Leu Glu Tyr 
105 

aag aag aac cgc aac aag 
Lys Lys Asn Arg Asn Lys 
120 

gca ctg ggc atg tea cac 
Ala Leu Gly Met Ser His 
135 140 
get gag aag agg aag ctg 
Ala Glu Lys Arg Lys Leu 

155 



gag aag tgt gag cgc age 
Glu Lys Cys Glu Arg Ser 
110 

tgc cag tac tgc cgc ttc 
Cys Gin Tyr Cys Arg Phe 
125 130 

aac get ate cgt ttt ggt 
Asn Ala lie Arg Phe Gly 

145 

gtg gca ggg ctg act gca 
Val Ala Gly Leu Thr Ala 

160 



tgc aag att cag 691 

Cys Lys lie Gin 

115 

cag aag tgc ctg 739 
Gin Lys Cys Leu 

egg atg ccg gag 787 
Arg Met Pro Glu 

150 

aac gag ggg age 835 
Asn Glu Gly Ser 
165 
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cag tac aac cca cag gtg gcc gac ctg aag gcc ttc tec aag cac ate 883 

Gin Tyr Asn Pro Gin Val Ala Asp Leu Lys Ala Phe Ser Lys His lie 

170 175 180 

tac aat gcc tac ctg aaa aac ttc aac atg ace aaa aag aag gcc cgc 931 

Tyr Asn Ala Tyr Leu Lys Asn Phe Asn Met Thr Lys Lys Lys Ala Arg 
185 190 195 

age ate etc ace ggc aaa gcc age cac acg gcg ccc ttt gtg ate cac 979 

Ser He Leu Thr Gly Lys Ala Ser His Thr Ala Pro Phe Val He His 
200 205 210 

gac ate gag aca ttg tgg cag gca gag aag ggg ctg gtg tgg aag cag 1027 

Asp He Glu Thr Leu Trp Gin Ala Glu Lys Gly Leu Val Trp Lys Gin 
215 220 225 230 

ttg gtg aat ggc ctg cct ccc tac aag gag ate age gtg cac gtc ttc 1075 

Leu Val Asn Gly Leu Pro Pro Tyr Lys Glu He Ser Val His Val Phe 

235 240 245 

tac cgc tgc cag tgc acc aca gtg gag ace gtg egg gag etc act gag 1123 

Tyr Arg Cys Gin Cys Thr Thr Val Glu Thr Val Arg Glu Leu Thr Glu 

250 255 260 

ttc gcc aag age ate ccc age ttc age age etc ttc etc aac gac cag 1171 

Phe Ala Lys Ser He Pro Ser Phe Ser Ser Leu Phe Leu Asn Asp Gin 
265 270 275 

gtt acc ctt etc aag tat ggc gtg cac gag gcc ate ttc gcc atg ctg 1219 

Val Thr Leu Leu Lys Tyr Gly Val His Glu Ala He Phe Ala Met Leu 
280 285 290 

gcc tct ate gtc aac aag gac ggg ctg ctg gta gcc aac ggc agt ggc 1267 

Ala Ser He Val Asn Lys Asp Gly Leu Leu Val Ala Asn Gly Ser Gly 
295 300 305 310 

ttt gtc acc cgt gag ttc ctg cgc age etc cgc aaa ccc ttc agt gat 1315 

Phe Val Thr Arg Glu Phe Leu Arg Ser Leu Arg Lys Pro Phe Ser Asp 

315 320 325 



ate att gag cct aag ttt gaa ttt get gtc aag ttc aac gcc ctg gaa 
He He Glu Pro Lys Phe Glu Phe Ala Val Lys Phe Asn Ala Leu Glu 



1363 
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330 335 340 

ctt gat gac agt gac ctg gcc eta ttc att gcg gec ate att ctg tgt 1411 
Leu Asp Asp Ser Asp Leu Ala Leu Phe lie Ala Ala lie He Leu Cys 
345 350 355 

gga gac egg cca ggc etc atg aac gtt cca egg gtg gag get ate cag 1459 
Gly Asp Arg Pro Gly Leu Met Asn Val Pro Arg Val Glu Ala He Gin 
360 365 370 

gac acc ate ctg cgt gcc etc gaa ttc cac ctg cag gcc aac cac cct 1507 
Asp Thr He Leu Arg Ala Leu Glu Phe His Leu Gin Ala Asn His Pro 
375 380 385 390 

gat gcc cag tac etc ttc ccc aag ctg ctg cag aag atg get gac ctg 1555 
Asp Ala Gin Tyr Leu Phe Pro Lys Leu Leu Gin Lys Met Ala Asp Leu 

395 400 405 

egg caa ctg gtc acc gag cac gcc cag atg atg cag egg ate aag aag 1603 
Arg Gin Leu Val Thr Glu His Ala Gin Met Met Gin Arg He Lys Lys 

410 415 420 

acc gaa acc gag acc teg ctg cac cct ctg etc cag gag ate tac aag 1651 
Thr Glu Thr Glu Thr Ser Leu His Pro Leu Leu Gin Glu He Tyr Lys 
425 430 435 

gac atg tac taaeggegge acccaggcct ccctgcagac tccaatgggg 1700 
Asp Met Tyr 
440 

ccagcactgg aggggcccac ccacatgact tttccattga ccagctctct tcctgtcttt 1760 

gttgtctccc tctttctcag ttcctctttc ttttctaatt cctgttgctc tgtttcttcc 1820 

tttctgtagg tttctctctt cccttctccc ttctcccttg ccctcccttt ctctctccta 1880 

tccccacgtc tgtcctcctt tcttattctg tgagatgttt tgtattattt caccagcagc 1940 

atagaacagg acctctgctt ttgcacacct tttccccagg agcagaagag agtgggcctg 2000 

ccctctgccc catcattgea cctgcaggct taggtcctca cttctgtctc ctgtcttcag 2060 

agcaaaagac ttgagecate caaagaaaca ctaagctctc tgggcctggg ttccagggaa 2120 
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ggctaagcat ggcctggact gactgcagcc ccctatagtc atggggtccc tgctgcaaag 2180 

gacagtggca gaccccggca gtagagccga gatgcctccc caagactgtc attgcccctc 2240 

cgatcgtgag gccacccact gacccaatga tcctctccag cagcacacct cagccccact 2300 

gacacccagt gtccttccat cttcacactg gtttgccagg ccaatgttgc tgatggcccc 2360 

tccagcacac acacataagc actgaaatca ctttacctgc aggcaccatg cacctccctt 2420 

ccctccctga ggcaggtgag aacccagaga gaggggcctg caggtgagca ggcagggctg 2480 

* 

ggccaggtct ccggggaggc aggggtcctg caggtcctgg tgggtcagcc cagcacctcg 2540 

cccagtggga gcttcccggg ataaactgag cctgttcatt ctgatgtcca tttgtcccaa 2600 

tagctctact gccctcccct tcccctttac tcagcccagc tggccaccta gaagtctccc 2660 

tgcacagcct ctagtgtccg gggaccttgt gggaccagtc ccacaccgct ggtccctgcc 2720 

ctcccctgct cccaggttga ggtgcgctca cctcagagca gggccaaagc acagctgggc 2780 

atgccatgtc tgagcggcgc agagccctcc aggcctgcag gggcaagggg ctggctggag 284 0 

tctcagagca cagaggtagg agaactgggg ttcaagccca ggcttcctgg gtcctgcctg 2900 

gtcctccctc ccaaggagcc attctatgtg actctgggtg gaagtgccca gcccctgcct 2960 

g 2961 



<210> 6 



<211> • 441 



<212> PRT 



<213> Homo sapiens 



<400> 6 

Met Glu Gin Pro Gin Glu Glu Ala Pro Glu Val Arg Glu Glu Glu Glu 
15 10 15 
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Lys Glu Glu Val Ala 

20 

Pro Gin His Ala Leu 
35 

Ser Ser Pro Pro Ser 
50 

Ala Ser Cys Gly Ser 
65 

Ala Ser Gly Phe His 

85 

Phe Phe Arg Arg Thr 

100 

Arg Ser Cys Lys lie 
115 

Arg Phe Gin Lys Cys 
130 



Phe Gly Arg Met Pro 
145 

Thr Ala Asn Glu Gly 

165 

Ala Phe Ser Lys His 

180 

Thr Lys Lys Lys Ala 
195 

Ala Pro Phe Val He 
210 



Glu Ala Glu Gly Ala Pro 

25 

Pro Ser Ser Ser Tyr Thr 
40 

Leu Leu Asp Gin Leu Gin 
55 

Leu Asn Met Glu Cys Arg 
70 75 

Tyr Gly Val His Ala Cys 

90 

He Arg Met Lys Leu Glu 

105 

Gin Lys Lys Asn Arg Asn 
120 

Leu Ala Leu Gly Met Ser 
135 



Glu Ala Glu Lys Arg Lys 
150 155 

Ser Gin Tyr Asn Pro Gin 

170 

He Tyr Asn Ala Tyr Leu 

185 

Arg Ser He Leu Thr Gly 
200 

His Asp He Glu Thr Leu 
215 



Glu Leu Asn Gly Gly 
30 

Asp Leu Ser Arg Ser 
45 

Met Gly Cys Asp Gly 
60 

Val Cys Gly Asp Lys 

80 

Glu Gly Cys Lys Gly 

95 

Tyr Glu Lys Cys Glu 
110 

Lys Cys Gin Tyr Cys 
125 

His Asn Ala lie Arg 
140 



Leu Val Ala Gly Leu 

160 

Val Ala Asp Leu Lys 

175 

Lys Asn Phe Asn Met 
190 

Lys Ala Ser His Thr 
205 

Trp Gin Ala Glu Lys 
220 
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Gly Leu Val Trp 
225 

He Ser Val His 



Val Arg Glu Leu 

260 

Leu Phe Leu Asn 
275 

Ala He Phe Ala 
290 

Val Ala Asn Gly 
305 

Arg Lys Pro Phe 



Lys Phe Asn Ala 

340 

Ala Ala He He 
355 

Arg Val Glu Ala 
370 



Leu Gin Ala Asn 
385 



Gin Lys Met Ala 



Lys Gin Leu Val 
230 

Val Phe Tyr Arg 
245 

Thr Glu Phe Ala 



Asp Gin Val Thr 

280 

Met Leu Ala Ser 
295 

Ser Gly Phe Val 
310 

Ser Asp He He 
325 



Leu Glu Leu Asp 



Leu Cys Gly Asp 

360 

» 

He Gin Asp Thr 
375 



His Pro Asp Ala 
390 



Asp Leu Arg Gin 
405 



Asn Gly Leu Pro 
235 

Cys Gin Cys Thr 
250 

Lys Ser He Pro 
265 

Leu Leu Lys Tyr 



He Val Asn Lys 

300 

Thr Arg Glu Phe 
315 

Glu Pro Lys Phe 
330 



Asp Ser Asp Leu 
345 

Arg Pro Gly Leu 



He Leu Arg Ala 

380 



Gin Tyr Leu Phe 
395 



Leu Val Thr Glu 
410 



Pro Tyr Lys Glu 

240 

Thr Val Glu Thr 
255 

Ser Phe Ser Ser 
270 

Gly Val His Glu 
285 

Asp Gly Leu Leu 



Leu Arg Ser Leu 

320 

Glu Phe Ala Val 
335 



Ala Leu Phe He 
350 

Met Asn Val Pro 
365 

Leu Glu Phe His 



Pro Lys Leu Leu 

400 



His Ala Gin Met 
415 
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Met Gin Arg lie Lys Lys Thr Glu Thr Glu 

420 425 

Leu Gin Glu He Tyr Lys Asp Met Tyr 
435 440 

<210> 7 

<211> 825 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1)..(825) 
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Thr Ser Leu His Pro Leu 

430 



<400> 7 



cag tac aac cca cag gtg gcc gac ctg aag gcc ttc tec aag cac ate 48 
Gin Tyr Asn Pro Gin Val Ala Asp Leu Lys Ala Phe Ser Lys His lie 
15 10 15 



tac aat gcc tac ctg aaa aac ttc aac atg acc aaa aag aag gcc cgc 96 
Tyr Asn Ala Tyr Leu Lys Asn Phe Asn Met Thr Lys Lys Lys Ala Arg 

20 25 30 



age ate etc acc ggc aaa gcc age cac acg gcg ccc ttt gtg ate cac 14 4 

Ser He Leu Thr Gly Lys Ala Ser His Thr Ala Pro Phe Val He His 
35 40 45 



gac ate gag aca 
Asp He Glu Thr 



ttg tgg cag gca gag aag ggg ctg gtg tgg aag cag 
Leu Trp Gin Ala Glu Lys Gly Leu Val Trp Lys Gin 



192 
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50 55 60 

ttg gtg aat ggc ctg cct ccc tac aag gag ate age gtg cac gtc ttc 
Leu Val Asn Gly Leu Pro Pro Tyr Lys Glu lie Ser Val His Val Phe 
65 70 75 80 

tac cgc tgc cag tgc acc aca gtg gag acc gtg egg gag etc act gag 
Tyr Arg Cys Gin Cys Thr Thr Val Glu Thr Val Arg Glu Leu Thr Glu 

85 90 95 

ttc gec aag age ate ccc age ttc age age etc ttc etc aac gac cag 

Phe Ala Lys Ser lie Pro Ser Phe Ser Ser Leu Phe Leu Asn Asp Gin 

100 105 110 

* 

gtt acc ctt etc aag tat ggc gtg cac gag gee ate ttc gee atg ctg 

Val Thr Leu Leu Lys Tyr Gly Val His Glu Ala lie Phe Ala Met Leu 

115 120 125 

gee tct ate gtc aac aag gac ggg ctg ctg gta gee aac ggc agt ggc 
Ala Ser lie Val Asn Lys Asp Gly Leu Leu Val Ala Asn Gly Ser Gly 
130 135 140 

ttt gtc acc cgt gag ttc ctg cgc age etc cgc aaa ccc ttc agt gat 
Phe Val Thr Arg Glu Phe Leu Arg Ser Leu Arg Lys Pro Phe Ser Asp 
145 150 155 160 

ate att gag cct aag ttt gaa ttt get gtc aag ttc aac gee ctg gaa 
He He Glu Pro Lys Phe Glu Phe Ala Val Lys Phe Asn Ala Leu Glu 

165 170 175 

ctt gat gac agt gac ctg gee eta ttc att gcg gee ate att ctg tgt 
Leu Asp Asp Ser Asp Leu Ala Leu Phe He Ala Ala He He Leu Cys 

180 185 190 



gga gac egg cca 
Gly Asp Arg Pro 
195 

gac acc ate ctg 
Asp Thr He Leu 
210 

gat gee cag tac 
Asp Ala Gin Tyr 
225 



ggc etc atg aac 
Gly Leu Met Asn 

200 

cgt gee etc gaa 
Arg Ala Leu Glu 
215 

etc ttc ccc aag 
Leu Phe Pro Lys 
230 



gtt cca egg gtg 
Val Pro Arg Val 

ttc cac ctg cag 
Phe His Leu Gin 

220 

ctg ctg cag aag 
Leu Leu Gin Lys 
235 



gag get ate cag 
Glu Ala He Gin 
205 

gee aac cac cct 
Ala Asn His Pro 

atg get gac ctg 
Met Ala Asp Leu 

240 
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240 



288 



336 



384 



432 



480 



528 



576 



624 



672 



720 
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egg caa ctg gtc acc gag cac gec cag atg atg cag egg ate aag aag 768 
Arg Gin Leu Val Thr Glu His Ala Gin Met Met Gin Arg lie Lys Lys 

245 250 255 

acc gaa acc gag acc teg ctg cac cct ctg etc cag gag ate tac aag 816 
Thr Glu Thr Glu Thr Ser Leu His Pro Leu Leu Gin Glu lie Tyr Lys 

260 265 270 

gac atg tac 825 
Asp Met Tyr 
275 

<210> 8 
<211> 275 
<212> PRT 



<213> Homo sapiens 



<400> 8 

Gin Tyr Asn Pro 

1 

Tyr Asn Ala Tyr 

20 



Gin Val Ala Asp 
5 

Leu Lys Asn Phe 



Leu Lys Ala Phe 
10 

Asn Met Thr Lys 
25 



Ser Lys His lie 
15 

Lys Lys Ala Arg 
30 



Ser He Leu Thr Gly Lys Ala Ser His Thr Ala Pro Phe Val He His 
35 40 45 

Asp He Glu Thr Leu Trp Gin Ala Glu Lys Gly Leu Val Trp Lys Gin 
50 55 60 



Leu Val Asn Gly Leu Pro Pro Tyr Lys Glu He Ser Val His Val Phe 
65 70 75 80 
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Tyr Arg Cys Gin Cys Thr 

85 

Phe Ala Lys Ser He Pro' 

100 

Val Thr Leu Leu Lys Tyr 
115 

Ala Ser He Val Asn Lys 
130 

Phe Val Thr Arg Glu Phe 
145 150 

He He Glu Pro Lys Phe 

165 

Leu Asp Asp Ser Asp Leu 

180 

Gly Asp Arg Pro Gly Leu 
195 

Asp Thr He Leu Arg Ala 
210 

Asp Ala Gin Tyr Leu Phe 
225 230 



-22- 

Thr Val Glu Thr Val Arg 

90 

Ser Phe Ser Ser Leu Phe 
105 

Gly Val His Glu Ala He 
120 

Asp Gly Leu Leu Val Ala 
135 140 

Leu Arg Ser Leu Arg Lys 

155 

Glu Phe Ala Val Lys Phe 

170 

Ala Leu Phe He Ala Ala 
185 

Met Asn Val Pro Arg Val 
200 

Leu Glu Phe His Leu Gin 
215 220 

Pro Lys Leu Leu Gin Lys 

235 
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Glu Leu Thr Glu 
95 

Leu Asn Asp Gin 
110 

Phe Ala Met Leu 
125 

Asn Gly Ser Gly 



Pro Phe Ser Asp 

160 

Asn Ala Leu Glu 
175 

He lie Leu Cys 
190 

Glu Ala lie Gin 
205 

Ala Asn His Pro 



Met Ala Asp Leu 

240 



Arg Gin Leu Val Thr Glu His Ala Gin Met Met Gin Arg lie Lys Lys 

245 250 255 

Thr Glu Thr Glu Thr Ser Leu His Pro Leu Leu Gin Glu lie Tyr Lys 

260 265 270 

Asp Met Tyr 
275 
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<210> 9 
<211> 11 
<212> PRT 

<213> Artificial Purification Sequence 
<400> 9 

Met Lys Lys Gly His His His His His His Gly 
15 10 
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